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OBJECT-ORIENTED  FORMAL  SPECIFICATION  USING  REFINE: 

A  FEASIBILITY  STUDY 


Barrett  R.  Bryant 
Associate  Professor 

Department  of  Computer  and  Information  Sciences 
The  University  of  Alabama  at  Birmingham 


Abstract 


This  paper  proposes  object-oriented  extensions  to  the  REFINE  specification  language 
which  will  facilitate  transformation  of  REFINE  specifications  into  C-f-f  programs.  The  ad¬ 
vantages  of  this  approach  are:  1)  existing  REFINE  specifications  may  be  enhanced  through 
interfaces  with  C-1-+  applications,  2)  an  integration  of  object-oriented,  logic  and  functional 
specification  methodologies  is  achieved,  and  3)  object-orientation  provides  support  for  paral¬ 
lel  execution  in  a  natural  manner  which  will  allow  REFINE  specifications  to  be  transformed 
into  parallel  programs  where  appropriate. 
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OBJECT-ORIENTED  FORMAL  SPECIFICATION  USING  REFINE: 

A  FEASIBILITY  STUDY 

Barrett  R.  Bryant 


1  Introduction 

The  Knowledge- Based  Software  Assistant  (KBSA)  [Whit91]  was  developed  at  Rome  Lab¬ 
oratory  to  automate  the  development  of  software  systems  throughout  their  entire  software 
life-cycle.  KBSA  is  unique  in  that  it  provides  knowledge-base  support  for  the  software 
development  process  undertaken  by  the  software  development  team.  The  specification  lan¬ 
guage  used  in  KBSA  is  REFINE”^*^  [Reas90],  developed  by  Reasoning  Systems,  Inc.,  which 
is  an  integration  of  set  theoretic  constructions,  logical  rules,  transformation  rules,  pattern 
matching,  and  procedural  structures  in  a  knowledge-base  system  framework.  REFINE  is 
executable  through  program  transformation  methodology  and  hence  allows  the  construction 
of  executable  specifications. 

Current  software  development  methodology  has  been  heavily  influenced  by  the  object- 
oriented  programming  paradigm  [Booc90].  This  paradigm  is  characterized  by  the  represen¬ 
tation  of  a  software  system  as  a  set  of  classes,  each  of  which  defines  an  abstract  data  type 
and  its  corresponding  set  of  operations.  Object-orientation  would  exist  in  the  specification, 
design,  and  implementation  phases  of  the  software  system’s  life  cycle.  Reuse  of  existing 
classes  for  a  wide  variety  of  applications  would  reduce  the  amount  of  time  required  for  soft¬ 
ware  validation  and  maintenance.  Unfortunately,  the  existing  version  of  REFINE  does  not 
support  object-orientation. 

In  this  paper,  we  propose  object-oriented  extensions  to  REFINE  to  facilitate  reuse 
at  the  specification  level.  These  extensions  will  be  modelled  after  C-f-t-  [Stro86]  for  the 
pragmatic  reason  that  it  is  the  most  widely  used  object-oriented  programming  language 
in  commercial  environments.  It  is  hoped  that  the  relationship  to  C-f- f-  will  facilitate  the 
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acceptance  of  object-oriented  REFINE  among  software  developers,  allow  REFINE  programs 
to  interface  with  the  large  number  of  existing  applications  developed  in  and  will 

enhance  C-|--|-  itself  by  providing  an  interface  with  an  executable  specification  language 
featuring  a  number  of  powerful  tools  not  currently  available  in  the  C-|--f  language.  The  new 
specification  language  will  be  upwardly  compatible  with  the  current  version  of  REFINE,  and 
is  especially  intended  to  preserve  properties  of  the  original  REFINE  language  which  allow 
reasoning  about  programs. 

The  paper  is  organized  as  follows.  Section  2  discusses  object-oriented  software  specifica¬ 
tion.  In  section  3,  we  evaluate  the  issues  involved  in  extending  REFINE  to  be  object-oriented. 
Automatic  parallelization  of  object-oriented  specifications  is  discussed  in  section  4.  Finally, 
we  conclude  with  section  5. 


2  Object-Oriented  Software  Specification 

Object-orientation  is  the  concept  of  modelling  real-world  entities  as  objects  and  character¬ 
izing  the  behavior  of  objects  through  the  definition  of  classes  [Wegn87].  The  fundamental 
properties  of  objects  belonging  to  a  certain  class  are  defined  by  methods  and  the  components 
of  objects,  called  instance  variables,  along  with  the  associated  methods  form  an  abstract  data 
type.  Since  the  details  of  the  various  methods  need  not  be  known,  only  their  behavior  being 
necessary,  objects  may  be  reasoned  about  abstractly  [Dahl87].  This  is  further  facilitated  by 
the  fact  that  the  names  of  instance  variables  and  methods  in  each  class  are  local  to  that 
class  and  hence  lexical  scoping  enforces  the  encapsulation  of  the  data  type.  A  further  funda¬ 
mental  property  of  object-orientation  is  the  notion  of  inheritance  whereby  a  class  is  denoted 
as  being  a  subclass  of  another  class  and  thereby  inherits  all  instance  variables  and  methods 
of  the  parent  class.  This  allows  for  further  specialization  of  object  behaviors.  Inheritance 
serves  to  structure  the  objects  into  members  of  a  class  hierarchy  which  is  well  founded  in 
type  theory  [Dahl90].  The  development  of  methods  into  implementations  may  conveniently 
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follow  stepwise  refinement  principles  [Lano92,  Saak92]. 

Current  efforts  at  object-oriented  specification  include  FOOPS  [Gogu87],  an  algebra¬ 
ic  specification  language  supporting  the  definition  of  classes  by  describing  the  algebraic 
properties  of  their  members,  and  object-oriented  extensions  to  the  Z  specification  language 
[Cusa91,  Lano91].  Some  of  the  difficulties  in  making  such  extensions  to  Z  are  the  need  for 
universal  and  existential  quantification  of  types  to  provide  polymorphism  as  well  as  high- 
order  operations  such  as  the  application  of  quantification  to  object  methods  [Whys90]. 

Since  objects  are  independent  entities  which  communicate  through  message  passing, 
concurrency  is  natural  to  object-orientedness  [Yone87].  There  are  many  forms  of  concur¬ 
rency  possible  within  an  object-oriented  system.  Inter-object  concurrency  refers  to  different 
objects  carrying  out  different  activities  at  the  same  time.  Intra-object  concurrency  refers 
to  a  single  object  executing  several  methods  simultaneously.  Each  of  these  methods  could 
themselves  be  carrying  out  several  operations  concurrently  [Nels91].  The  objects  have  differ¬ 
ent  threads  of  control  based  on  the  level  of  concurrency.  The  different  styles  of  interaction 
between  communicating  objects  can  be  classified  based  on  the  send-receive  mechanism,  call- 
reply  mechanism  and  synchronous  or  asynchronous  or  future  mechanisms  [Toml89].  The 
main  issues  in  such  designs  are  whether  message  passing  should  be  synchronous  or  asyn¬ 
chronous  or  be  in  the  form  of  a  remote  procedure  call,  and  the  granularity  of  parallelism: 
fine-grained  (instruction-level),  medium-grained  (loop  parallelism),  or  course-grained  (object 
method  level). 

The  various  efforts  towards  concurrent  object-oriented  programming  languages  have 
been  in  extending  existing  object-oriented  languages  to  support  concurrency  or  developing 
new  languages  [Nels91].  For  example.  Concurrent  C-t--|-  [Geha88]  is  one  extension  of  C-I-+. 
The  actor  model  has  been  incorporated  into  C-l--t-  in  [Kafu90].  These  various  concurrent 
object-oriented  programming  systems  provide  some  explicit  concurrency  constructs  which 
the  programmer  has  to  use  to  obtain  concurrency.  In  a  concurrent  object-oriented  program¬ 
ming  system,  the  specification  of  synchronization  constraints  and  its  relationship  with  en- 


1-5 


capsulation  is  important;  and  also  the  relationship  between  synchronization  constraints  and 
inheritance  [Nier90].  The  different  systems  realize  concurrency  over  a  shared  or  distributed 
platform  [Bal89,  Chin91]. 

We  believe  that  concurrency  can  also  be  conveniently  added  to  object-oriented  speci¬ 
fication.  However,  the  proposed  object-oriented  REFINE  language  has  a  major  advantage 
over  standard  C+-f-  with  respect  to  concurrency.  The  features  of  the  language  are  sufficiently 
high-level  that  they  may  contain  implicit  parallelism  which  may  be  detected  by  the  specifi¬ 
cation  language  compiler  and  this  parallelism  may  then  be  explicitly  incorporated  into  the 
generated  code.  For  example,  many  of  the  automatic  parallelization  techniques  described  in 
[Zima90]  would  be  applicable.  Furthermore,  the  exploitation  of  parallelism  in  object-oriented 
languages  is  based  upon  formal  methods  [Amer91]. 


3  Extending  REFINE  as  a  Specification  Language 

REFINE  was  designed  with  the  following  goals  in  mind  [Reas90]:  1)  development  of  the 
specification  is  accompanied  by  a  knowledge-base  which  stores  details  of  the  specification 
development  process,  2)  both  “wide-spectrum”  and  “multi-paradigm”  programming  are  sup¬ 
ported,  3)  the  language  should  be  extensible,  and  4)  there  should  be  language  support  for 
stepwise  refinement  including  representations  for  programs  and  transformation  rules.  We 
believe  that  each  of  these  goals  may  be  accomplished  within  the  proposed  C-1-+  framework. 
The  general  strategy  is  to  assume  a  preprocessor  for  translating  REFINE  syntax  into  the 
corresponding  C+-f  syntax.  REFINE  semantics  will  be  defined  using  a  collection  of  C+-t- 
classes,  meaning  that  for  every  REFINE  operation  on  a  data  type,  there  will  be  a  corre¬ 
sponding  C-f  +  method  for  that  data  type,  defined  as  a  class.  In  the  following  subsections, 
we  address  the  different  aspects  of  the  REFINE  language  in  turn,  discuss  how  it  may  be 
transformed  into  C-f- f ,  and  identify  where  parallelism  might  be  exploited. 


1-6 


3.1  Data  Types 


REFINE  provides  data  types  for  scalar  objects  including  numbers,  characters,  symbols,  and 
Booleans,  structured  objects  such  as  sets,  sequences,  strings  and  tuples,  as  well  as  binary 
relations  and  maps.  Undefined  and  typeless  values  are  also  represented  and  a  number  of 
data  type  conversions  are  provided. 


3.1.1  Scalar  Objects 

Numbers,  characters,  symbols  and  Booleans,  and  most  of  their  associated  operations  may 
be  trivially  defined  using  C++.  This  is  true  also  of  universal  and  existential  quantification 
and  nondeterminism  because  of  the  limitations  REFINE  places  on  the  bound  variables  to 
ensure  finiteness.  These  may  be  implemented  using  an  iterator  construction  in  C++.  In  a 
parallel  implementation,  such  quantifications  ranging  over  a  large  number  of  values  may  also 
be  considered  for  possible  partitioning  into  different  computational  threads. 

For  an  example  of  representing  a  REFINE  quantification  in  C++,  consider  the  specifi¬ 
cation; 

ex  (x,  y)  (y  *  X  =  420000  &  x  in  [1  ..  1000]  &  y  in  [1  ..  1000]) 

which  evaluates  to  true  because  x  =  600  and  y  =  700  is  one  such  solution.  A  corresponding 
C++  iteration  is: 

for  (x  =  1;  (x  <=  1000)  &&  ! exists;  x++) 

for  (y  =  1;  (y  <=  1000)  &&  ! exists;  y++) 
exists  =  y  ♦  X  ==  420000; 


Using  loop  parallelization  transformations,  this  could  be  transformed  into  equivalent  code 
which  executes  all  iterations  of  the  outer  loop  in  parallel,  as  in: 

doall  (x  =  1;  x  <=  1000;  x++) 

for  (y  =  1;  (y  <=  1000)  &&  Ixexists  [x] ;  y++) 
xexists  [x]  =  y  *  X  ==  420000; 
exists  =  ainy  (xexists) ; 
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The  results  of  each  iteration  of  x  are  tallied  in  the  xexists  array.  A  standard  function  any 
is  then  used  to  verify  if  any  element  of  the  xexists  array  is  true.  This  function  may  be 
implemented  in  parallel  as  well. 

3.1.2  Sets,  Sequences,  Strings  and  Tuples 

As  is  the  case  with  scalar  objects,  these  structured  objects  and  most  of  their  operations  are 
easily  defined  using  C-|-+.  Sets  may  be  implemented  using  either  bit  vectors  for  dense  sets  or 
hash  tables  for  sparse  sets.  Sequences  may  be  represented  using  list  data  structures.  Strings 
are  directly  representable  as  well.  Tuples  correspond  to  C++  structures. 

Similar  to  Boolean  quantification,  constructions  such  as  the  set  former  and  sequence 
former  and  operations  based  on  these  may  be  implemented  using  iterators  with  parallel  ex¬ 
ecution  feasible  for  very  large  sets  or  sequences.  Complexities  are  introduced  by  set  and 
sequence  operations  which  require  references  to  functions,  such  as  filter,  image,  and  re¬ 
duce.  In  the  case  where  function  names  are  passed  as  arguments  to  these  operations,  the 
“values”  of  these  arguments  may  be  pointers  to  the  appropriate  C++  functions.  REFINE 
also  allows  functions  to  be  represented  by  lambda  abstractions  which  are  not  directly  ex¬ 
ecutable  in  C++.  During  the  course  of  the  preprocessing  from  REFINE  into  C++,  some 
of  these  abstractions  may  be  converted  into  standard  functions  in  cases  where  the  lambda 
abstraction  is  applied  to  exactly  the  correct  number  of  arguments.  However,  if  the  number 
of  arguments  is  different  from  the  number  of  variables  bound  by  the  lambda  abstraction, 
then  some  additional  techniques  need  to  be  used.  One  possibility  is  to  convert  the  resulting 
function  expressions  into  supercombinators  [Peyt87].  In  this  approach,  lambda  abstractions 
are  translated  into  standard  functions  which  have  exactly  the  required  number  of  formal 
parameters  to  match  the  number  of  arguments,  either  by  adding  additional  formal  param¬ 
eters  or  by  recognizing  that  the  result  of  the  function  may  be  another  function  and  hence 
require  further  applications.  Supercombinator  generation  may  not  always  be  possible  during 
preprocessing  time  in  which  case  there  must  exist  methods  in  lambda  abstraction  classes  to 
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generate  the  supercombinator  expressions  dynamically.  This  will  be  facilitated  by  defining  a 
class  for  functions  and  manipulating  functions,  including  lambda  abstractions,  as  first-class 
objects. 

As  an  example  of  where  parallelism  may  occur,  consider  the  image  of  a  sequence  under 
a  map,  image  (f,  S),  where  f  is  a  mapping  function  and  S  is  a  sequence.  Represented  in 
C-b- b,  this  might  be  treated  as  a  message  image  (f)  being  sent  to  S  (i.e.  S  .  image  (f)).  The 
C-b+  code  to  implement  this  must  traverse  the  sequence  S,  applying  the  function  f  to  each 
member  of  S.  If  S  is  represented  as  a  structure  with  a  length  attribute  and  the  individual 
elements  are  stored  in  an  array,  then  the  code  might  be  as  follows: 

for  (i  =  0;  i  <  length;  i++) 

new_sequence  .  element  [i]  =  f  (element  [i] ) ; 

Now  the  code  may  be  parallelized  to  execute  each  iteration  of  the  loop  in  parallel.  Similar 
parallelizing  transformations  may  be  applied  to  the  reduce  and  filter  operations. 

Sets,  sequences  and  tuples  have  pattern  and  elliptical  constructions.  During  prepro¬ 
cessing,  such  constructions  can  be  converted  into  more  basic  operations  using  the  standard 
REFINE  semantics. 

3.1.3  Binary  Relations  and  Maps 

Binary  relations  are  essentially  implemented  in  REFINE  as  sets.  Hence,  set  operations 
cover  binary  relation  operations  with  the  exception  of  transitive  closure.  Maps  are  function 
objects  which  have  been  described  somewhat  in  the  previous  section.  As  mentioned  there, 
difficulties  arise  in  manipulating  functions  that  must  be  represented  dynamically.  Either 
supercombinator  reduction  or  graph  reduction  may  be  implemented  in  the  C-b+  function 
class  to  allow  function  operations  to  proceed  with  good  efficiency.  Functions  are  among  the 
least  natural  REFINE  structures  to  represent  using  C-b+  since  there  is  no  existing  support 
for  higher-order  functions.  However,  cis  mentioned  earlier,  a  class  can  be  constructed  to 
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represent  such  functions  which  would  then  allow  higher-order  functions  to  manipulate  C-|  h 
objects.  This  could  become  the  framework  of  a  powerful  integration  of  functional  and  object- 
oriented  programming  methodologies. 

3.1.4  Knowledge-Base  Objects 

REFINE  provides  full  support  for  the  construction  of  an  object-oriented  knowledge-base 
system.  This  means  that  objects  may  be  created  which  represent  some  knowledge  about 
the  specification  development  process.  Furthermore,  these  objects  form  a  hierarchy  thereby 
facilitating  the  reuse  of  components.  The  REFINE  “knowledge-base "  may  be  represented  by 
a  global  object  in  C-| — h  capable  of  storing  and  manipulating  the  tree-structured  hierarchy 
in  response  to  REFINE  statements.  In  a  sense,  the  maintenance  of  this  knowledge-base 
is  an  object-oriented  system  embedded  within  and  implemented  by  a  object-oriented 

program. 

3.2  Control  Constructs 

Many  control  constructs  found  in  REFINE  are  similar  to  those  in  most  conventional  lan¬ 
guages,  including  C-1-+.  Here  we  concentrate  on  the  unusual  ones.  The  translations  of 
assignment  statements,  statement  sequencing,  conditional  statements,  enumeration,  while 
statements,  and  function  definitions  are  straightforward.  Some  parallelism  is  possible  during 
enumeration  and  while-iteration  as  analysis  might  allow  for  the  bodies  of  these  statements 
to  be  executed  in  parallel  across  different  iterations. 

3.2.1  Variables 

A  let  statement  is  used  to  introduce  bindings  for  local  variables.  The  scoping  rules  of  let 
statements  are  similar  to  C-| — 1-  blocks  so  a  one-to-one  translation  between  the  two  would  be 
the  simplest  approach.  This  would  not  be  possible  if  there  were  a  “parallel”  assignment  of 
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the  form: 


let  (x  =  1,  y  =  2) 

let  (x  =  y,  y  =  x) 

In  this  case,  the  innermost  x  is  to  receive  the  value  2  and  the  innermost  y  the  value  1.  The 
simplest  solution  to  this  is  to  substitute  the  values  assigned  to  the  outer  variables  for  those 
variables  within  the  parallel  assignment  so  as  to  break  the  parallelism.  As  long  as  there  is  an 
order  to  the  variable  assignments,  this  order  may  be  followed  in  the  C++  block  in  making 
the  assignments.  For  example,  if  we  have: 

let  (x  =  1) 

let  (x  =  2,  y  =  x) 

we  may  always  translate  this  into: 

{  int  X  =  1 

{  int  y  =  x;  int  x  =  2; 

} 

} 

where  the  initializations  are  performed  in  order.  In  performing  this  translation,  some  opti¬ 
mizations  of  the  let-block  may  also  be  done  to  eliminate  unnecessary  variable  assignment  and 
redundant  computation  of  common  subexpressions.  Note  that  a  type  analysis  is  required  of 
the  values  being  assigned  to  let-variables  so  that  the  appropriate  declarations  in  C++  can 
be  made. 

3.3  Knowledge-Base  Object  Replacement  and  Transforms 

As  mentioned  earlier,  the  REFINE  knowledge-base  should  be  accessible  through  a  global 
object  with  methods  defined  for  knowledge-base  operations.  One  such  operation  is  the  re¬ 
place  statement  which  replaces  one  knowledge-base  object  by  another.  We  assume  that  the 
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knowledge-base  class  has  been  defined  with  appropriate  methods  to  perform  such  replace¬ 
ments. 

Transform  statements  are  one  of  the  most  unique  aspects  of  the  REFINE  language 
in  that  they  provide  rules  through  which  the  underlying  knowledge-base  should  be  updated. 
In  effect,  they  may  be  used  to  specify  the  replacement  of  program  constructs  by  other 
program  constructs.  In  another  sense,  transformation  rules  may  be  thought  of  as  declarative 
interpretations  of  program  constructions.  The  implementation  of  transforms  in  C-1-+  is 
similar  to  that  of  replacements  except  that  it  happens  at  the  global  knowledge-base  object 
level  rather  than  in  a  particular  specified  object.  There  are  a  number  of  complicated  semantic 
rules  pertaining  to  the  implementation  of  transforms  but  all  may  be  expressed  through 
appropriate  C-f-t-  methods. 

REFINE  permits  a  form  of  declarative  logic  programming  through  the  presence  of  as¬ 
sertion  rules.  They  are  typically  used  within  functions  to  give  a  declarative  specification  of 
the  function  semantics.  These  assertion  rules  may  then  be  used  by  the  REFINE  compiler 
to  produce  program  code  for  that  function.  The  methods  used  to  manipulate  the  global 
knowledge-base  may  be  used  to  interpret  logical  cissertions  as  well.  These  would  include  an 
extensive  array  of  theorem-proving  axioms  to  symbolically  manipulate  or  verify  the  asser¬ 
tions. 


3.4  C++  Infrastructure 

The  target  code  of  the  preprocessor  we  have  been  describing  will  be  embedded  within  an 
established  C-f -|-  infrastructure  of  existing  clcisses  and  their  associated  methods.  We  envision 
a  multiple-inheritance  class  hierarchy  providing  a  wide  range  of  abstract  data  types  for  the 
target  program  to  draw  upon.  As  has  been  seen  from  the  discussion,  this  will  include  classes 
for  manipulating  functions  as  first-class  objects,  classes  for  evaluating  declarative  specifica¬ 
tions  of  logical  rules,  and  classes  for  maintaining  a  separate  object-oriented  knowledge-base. 
In  this  sense,  the  implementation  combines  implementation  technologies  for  functional,  log- 
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ic,  and  object-oriented  programming  paradigms.  We  envision  that  REFINE  statements  will 
be  directly  embedded  into  C-)-- (-  functions  and  class  definitions  and  the  preprocessor  will 
distinguish  the  REFINE  code  and  make  the  appropriate  translations.  This  embedding  of 
REFINE  into  C-f-f-  will  allow  the  REFINE  specification  being  developed  to  interface  with 
existing  C+-f  classes,  thereby  facilitating  reuse  of  developed  code  at  the  specification  level. 
Furthermore,  it  serves  to  extend  C-t— 1-  with  the  very  advanced  and  powerful  features  of  the 
REFINE  language,  meaning  that  applications  under  development  in  C+-1-  can  be  “coded” 
at  the  specification  level  rather  than  the  program  level,  with  the  preprocessor  and  associated 
transformation  rules  in  REFINE  doing  the  program  level  coding. 

In  addition  to  implementing  the  features  of  REFINE  which  will  be  translated,  this 
underlying  C+-f  infrastructure  also  introduces  some  additional  semantic  complexities.  These 
are  enumerated  by  [Budd91]  as  follows; 

1.  C++  does  not  maintain  complete  run-time  information  about  the  types  of  objects. 
On  the  other  hand,  objects  are  often  assigned  values  through  pointers.  An  erroneous 
assignment  of  an  object  reference  may  not  be  detected  by  the  C++  compiler.  In 
order  to  be  used  as  a  specification  language,  this  situation  must  be  avoided  in  the 
REFINE/C++  interface. 

2.  In  multiple  inheritance,  the  same  name  may  be  inherited  from  two  different  parents  and 
hence  have  different  semantics.  This  may  be  solved  using  redefinition  which  overrides 
the  ambiguous  methods  or  renaming  which  changes  the  name  of  the  method.  However, 
the  system  must  ensure  that  the  ambiguity  is  resolved. 

3.  If  a  child  has  two  parents,  each  of  which  has  a  common  parent,  then  the  child  inherits 
from  the  grandparent  along  two  paths,  creating  ambiguity.  This  may  be  solved  by 
declaring  the  inheritance  of  the  parent  classes  from  the  grandparent  classes  as  virtual. 
Again,  the  system  should  ensure  that  this  is  done. 

4.  When  an  overloaded  name  is  used  in  C++,  the  inheritance  mechanism  is  first  used  to 
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find  a  name  scope  in  which  the  function  is  defined.  Parameter  types  are  then  used  to 
disambiguate  the  function  name  within  that  scope.  Sometimes  the  ambiguity  cannot 
be  resolved  due  to  the  types  being  convertible  (e.g.,  an  integer  may  be  converted  into 
a  floating  point  type). 


4  Automatic  Pcirallelization  of  Object-Oriented  Spec¬ 
ifications 

A  major  advantage  of  C+4-  (or  the  object-oriented  framework  in  general)  is  the  fact  that 
there  are  a  number  of  techniques  for  mapping  the  interacting  objects  into  parallel  and  dis¬ 
tributed  execution  models.  We  believe  that  a  good  parallelizing  compiler  can  detect  inter¬ 
dependencies  among  objects  and  map  them  onto  the  appropriate  processors  so  as  to  have 
effective  performance  enhancement. 

The  basic  structure  of  a  parallelizing  compiler  for  object-oriented  languages  would  be 
similar  to  that  for  more  conventional  languages  (e.g.  see  [Zima90]).  However,  there  are 
some  major  differences  in  the  nature  of  the  components,  caused  by  object-orientation.  The 
primary  components  of  the  parallelization  of  object-oriented  programs  are  flow  analysis, 
dependence  analysis,  and  parallel  code  generation. 

Flow  analysis  begins  with  intermediate  code  structures  produced  by  a  suitable  front-end 
and  constructs  a  flow  graph  showing  the  flow  of  control  throughout  individual  methods,  a 
collection  of  methods  within  a  class  or  object,  and  all  the  methods  of  a  program.  While 
there  are  similarities  between  this  flow  analysis  and  that  for  conventional  programming 
language  compilers,  it  is  made  considerably  more  complex  by  the  presence  of  polymorphism 
and  multiple  inheritance.  The  exact  details  for  solving  these  problems  are  currently  under 
investigation. 

Dependence  analysis  uses  the  control  flow  information  to  determine  all  the  data  de- 
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pendencies  between  the  different  objects.  The  primary  dependencies  in  an  object-oriented 
program  result  from  message  passing  among  objects.  To  determine  the  degree  of  intercon¬ 
nectedness  between  the  different  objects  and  the  flow  of  data  between  different  objects,  we 
may  construct  a  message  flow  graph.  We  may  also  construct  a  standard  dependence  graph 
for  the  code  within  each  method. 

Parallel  code  generation  will  use  the  flow  and  dependence  information  to  partition  the 
objects  into  processes  so  as  to  reduce  communication  costs.  This  includes  the  determination 
of  whether  an  object  process  should  be  synchronous  or  asynchronous.  Furthermore,  we  may 
generate  code  with  the  necessary  process  model  calls  to  automatically  control  communication 
between  processes  so  as  to  reduce  the  number  of  messages  that  must  be  sent  while  increasing 
the  amount  of  data  transferred  in  a  single  message.  This  would  include,  for  example,  the 
restructuring  of  iterative  constructions  involving  message  passing  of  individual  data  structure 
elements  (e.g.  sequences)  to  instead  send  a  single  message  of  the  entire  structure  to  a  newly 
created  method  which  then  iterates  over  the  structure. 

The  design  of  the  REFINE  language  itself  facilitates  such  automatic  parallelization. 
There  are  no  pointers  or  call  by  reference  to  create  aliasing  in  method  calls  and  all  object 
modification  may  therefore  be  done  explicitly.  The  fact  that  most  computation  takes  place 
using  itertive  constructions  would  allow  the  compiler  to  take  advantage  of  the  many  existing 
parallelization  techniques  that  have  been  developed  for  such  a  control  structure. 


5  Conclusions 

We  have  presented  an  analysis  of  the  feasibility  of  translating  REFINE  into  C-t--f  to  support 
object-oriented  specification  of  software  systems  under  the  C-f-f  model.  The  project  is 
generally  feasible  with  several  main  tasks  required  to  accomplish  the  implementation: 

1.  C-|— |-  classes  must  be  deflned  for  all  existing  REFINE  data  structures.  Many  of  these 
are  straightforward  with  the  exception  of  those  outlined  below. 
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2.  REFINE  is  currently  implemented  in  and  translated  into  Lisp.  There  are  a  number  of 
functional  language  features  supported  by  REFINE/Lisp  that  are  not  directly  prc-ent 
in  C++,  including  the  ability  to  manipulate  functions  as  first-class  objects  by  passing 
them  as  arguments  to  various  other  functions.  The  inclusion  of  lambda  abstractions 
is  an  embodiment  of  this  idea.  A  C++  class  for  functions  must  be  defined  with  an 
extensive  set  of  methods  to  support  the  neeoed  operations.  The  task  is  somewhat 
analogous  to  constructing  a  small  Lisp  interpreter  in  C++. 

3.  There  are  assertions  and  transformation  rules  present  in  REFINE  which  specify  in¬ 
formation  declaratively.  C-| — |-  does  not  have  an  inherent  capability  to  manipulate 
declarative  information  so  a  class  would  need  to  be  added  for  logical  rules  with  meth¬ 
ods  included  for  the  associated  proof  theory.  The  construction  of  such  a  class  would 
be  relatively  complex  and  essentially  embeds  a  Prolog  interpreter  into  C++. 

4.  REFINE  maintains  a  separate  object-oriented  knowledge-base  of  information  about 
the  development  of  the  specification.  Some  logical  aspects  of  this  knowledge-base  may 
be  implemented  using  the  logical  rule  class  described  previously.  However,  a  class 
hierarchy  must  be  maintained  within  the  program  and  the  implementation  of  this  will 
require  additional  classes. 

5.  There  are  several  aspects  of  C++  that  hinder  its  effectiveness  as  a  specification  lan¬ 
guage.  These  have  to  do  mainly  with  ambiguities  caused  by  multiple  inheritance  and 
polymorphism.  The  specification  environment  must  ensure  that  such  ambiguities  are 
not  created  in  the  specification  or  that  they  are  effectively  resolved  prior  to  invoking 
the  C++  compiler. 

There  are  a  number  of  places  within  the  existing  REFINE  language  where  parallelism 
may  be  detected  and  exploited  in  the  target  program.  However,  most  parallelism  will  likely 
come  from  the  interaction  of  the  objects  themselves  as  defined  in  C++. 
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Our  future  work  is  to  develop  a  complete  implementation  of  object-oriented  extensions 
to  REFINE  described  in  this  paper.  The  result  will  be  a  translation  from  REFINE  to 
which  allows  REFINE  specifications  to  interface  with  C-f-f  classes  and  objects.  Furthermore, 
we  will  develop  parallelizing  transformations  which  allow  the  resulting  C-|--f  programs  to  be 
executed  in  parallel  under  both  shared-memory  and  distributed  models  of  computations. 
This  will  allow  not  only  automatic  program  generation  from  specifications  but  automatic 
parallel  program  generation  from  specifications. 
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Abstract 


The  modern  command  center  must  support  force-level  data  fusion  (the  integration  and 
display  of  information  from  a  range  of  sources)  to  meet  the  needs  of  contingency  operations. 
Furthermore,  the  effectiveness  of  the  information  use  depends  upon  improving  the  integra¬ 
tion  of  information  processing  capabilities  with  the  cognitively-demanding  team  planning 
and  decision  making  tasks  of  the  joint  force  senior  command.  Thus,  “having  the  most  data” 
is  not  an  assurance  of  success  in  warfare.  The  leveraging  factors  in  information  technology 
for  command  support  are  the  ability  to  present  the  commander  with  the  right  information 
at  the  right  time  with  the  right  level  of  detail.  The  user-computer  interaction  (UCI)  design 
goal  is  an  information  interaction  concept  that  balances  maximal  use  of  the  positive  charac¬ 
teristics  of  the  various  information  presentation  and  interaction  technologies  to  meet  re¬ 
quirements,  while  minimizing  the  potential  for  distractions  from  the  primary  tasks. 
Achieving  this  balance  requires  understanding  not  only  the  cognitive  requirements  of  the 
task  and  the  characteristics  of  the  individual  UCI  technologies,  but  also  the  interaction  of 
technologies  and  the  additional  cognitive  requirements  imposed  by  the  introduction  of  new 
technological  capabilities  into  the  operational  environment.  One  of  the  principal  goals  of 
cognitive  systems  engineering  research  is  the  identification  and  exploration  of  these  rela¬ 
tionships.  Ultimately,  this  research  should  not  only  guide  the  design  of  new  systems,  but 
also  provide  feedback  to  drive  the  development  of  the  next  generation  of  technological  re¬ 
sources.  In  direct  contrast  to  the  tendency  for  “technology  push”  in  advanced  systems  de¬ 
velopment,  this  feedback  loop  supports  requirements-driven  design  and  reinforces  the  links 
to  the  human  user. 

This  report  investigates  the  “Data  Wall”  information  presentation  concept  and  its  as¬ 
sociated  interface  technologies  with  respect  to  their  potential  contributions  to  the  critical 
requirements  associated  with  theater-level  command  and  control  (C^)  decision  making.  The 
paper  further  highlights  the  pivotal  technological  issues  and  emphasizes  the  importance  of 
context-specific  research  to  support  successful  design  and  development  to  meet  the 
operational  requirements  of  decision  making. 
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INTEGRATING  MULTIMEDIA  INFORMATION  FOR  THEATER-LEVEL  COM¬ 
MAND  AND  CONTROL 

1.  Introduction 

1.1.  THE  CHANGING  CONTEXT  OF  MODERN  WARFARE 

In  recent  years,  the  rapid  evolution  of  both  the  international  political  environment  and 
innovations  in  technology  have  combined  to  produce  dramatic  changes  in  the  context  of 
modem  warfare.  In  addition  to  the  possibility  of  conventional  modes  of  warfare  with  well- 
understood  adversaries,  military  forces  must  now  prepare  for  contingency  operations  in¬ 
volving  limited-objective  or  low-intensity  warfare  in  relatively  novel  situations.  These  new 
modes  of  warfare  are  characterized  by  shorter  warning  times,  greater  ambiguity  and  a  re¬ 
quirement  to  plan  and  execute  responses  in  a  greatly  reduced  time  frame.  To  meet  the 
range  of  operational  requirements,  the  quest  for  and  capability  of  acquiring  more  and  better 
intelligence  on  evolving  situations  has  resulted  in  a  flood  of  complex  data  that  often  over¬ 
whelms  rather  than  informs.  Information  overload  is  further  compounded  by  the  require¬ 
ment  for  shorter  decision  cycles  due  to  dramatic  increases  in  speed  and  accuracy  of  weapon 
systems  . 

Current  doctrine  from  the  Joint  Chiefs  of  Staff  (JCS)  on  the  employment  of  US  Armed 
Forces  emphasizes  the  role  of  joint  (multi-service)  forces  in  modem  warfare  (JCS,  Nov. 

1991).  Additionally,  recent  operations  in  Southwest  Asia,  SomaHa,  and  Bosnia  have  in¬ 
volved  combined  (mvdtinational)  forces.  Both  operational  modes  of  warfare  require  the  ca¬ 
pability  to  rapidly  deploy  contingency  forces  into  regional  conflicts,  often  without  the  benefit 
of  previously  deployed  forward  bases  of  operation.  These  factors  impose  additional  require¬ 
ments  to  quickly  assemble  a  focused,  integrated  operational  picture  for  situation  assess¬ 
ment  and  planning.  The  information  technology  aspects  of  rapid  response  are  echoed  in  the 
JCS  mandate  for  “flexible  and  adaptive  decision  support  for  modular,  highly  mobile 
contingency  forces”  (JCS,  Oct.  1991). 

Emerging  in  these  models  is  the  notion  of  effective  and  efficient  command  and  control 
process  as  a  “force  multiplier”  with  the  potential  to  impact  battle  outcome  as  significantly 
as  advanced  weaponry.  However,  Coakley  (1991)  points  out  that  the  potential  of  C^  process 
as  a  force  multiplier  is  dependent  upon  the  organization’s  ability  to  employ  C^  technology 
effectively.  Thus,  “having  the  most  data”  is  not  an  assurance  of  success  in  warfare.  The 
leveraging  factors  in  information  technology  for  command  support  are  the  abdity  to  present 
the  commander  with  the  right  information  at  the  right  time  with  the  right  level  of  detail. 
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1.2.  LEVERAGING  INFORMATION  PRESENTATION  TECHNOLOGY  TO  SUPPORT  COMMAND  AND 

CONTROL 

The  modem  command  center  must  support  force  level  data  fusion  (the  integration  and 
display  of  information  from  a  range  of  sources)  to  meet  the  needs  of  contingency  operations. 
The  joint,  interoperable  architecture  supporting  “all  media  transmission  systems  —  voice, 
data,  message,  and  video,  as  well  as  imagery”  currently  envisioned  by  the  C‘^1  for  the  War¬ 
rior  Program  carries  with  it  the  implicit  requirement  to  effectively  integrate  and  display  in¬ 
formation  drawn  from  these  sources.  Furthermore,  the  effectiveness  of  the  information  use 
depends  upon  improving  the  integration  of  information  processing  capabilities  with  the 
cognitively-demanding  team  planning  and  decision  making  tasks  of  the  joint  force  senior 
command.  The  US  Air  Force’s  Air  Combat  Command  is  currently  pursuing  a  layered  ap¬ 
proach  to  information  presentation  whereby  battle  information  presented  at  the  senior 
command  level  is  generally  at  a  high  level  with  minimal  detail,  tailorable  to  match  decision 
requirements  and  individual  preference  (USAF  HQ  ACC,  1993).  This  approach  is  intended 
to  promote  increased  access  to  and  use  of  time-sensitive  information  by  allowing  the  com¬ 
mander  to  focus  on  (or  “pull”)  detailed  information  only  when  required  for  the  current  deci¬ 
sion.  In  this  model,  information  is  “pushed”  to  the  commander  based  on  its  value  in  main¬ 
taining  situational  awareness  and  relevancy  to  the  operational  decisions  rather  than  data 
availability. 

In  response  to  these  changes,  C^  systems  researchers  are  exploring  innovative  con¬ 
cepts  in  all  aspects  of  user-computer  interaction  (UCI)  and  information  presentation.  New 
developments  in  computer  architectures  have  made  technologically  and  economically  feasi¬ 
ble  a  variety  of  UCI  approaches  previously  thought  exotic  (e.g.,  natural  language  interfaces 
and  multimedia  information  display).  Enthusiasm  for  these  new  technologies  can  result  in 
their  premature  introduction  into  systems  without  a  clear  understanding  of  the  potential 
impacts.  On  the  other  hand,  certain  technologies  may  be  wnder -exploited  when  designers 
are  unaware  of  their  potential  advantages.  The  commumcation  of  information  is  not  a  neu¬ 
tral  transfer;  the  display  and  interaction  design  employed  in  any  system  transforms  infor¬ 
mation,  emphasizing  certain  aspects  and  de-emphasizing  others.  The  user-computer  inter¬ 
action  (UCI)  design  goal  is  an  information  interaction  concept  that  balances  maximal  use  of 
the  positive  characteristics  of  the  various  technologies  to  meet  requirements,  while  mini¬ 
mizing  the  potential  for  distractions  from  the  primary  tasks.  Achieving  this  balance  re¬ 
quires  understanding  not  only  the  cognitive  requirements  of  the  task  and  the  characteris¬ 
tics  of  the  individual  UCI  technologies,  but  also  the  interaction  of  technologies  and  the  addi¬ 
tional  cognitive  requirements  imposed  by  the  introduction  of  new  technological  capabilities 
into  the  operational  environment. 
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Cognitive  systems  engineering  involves  the  application  of  research  findings,  drawn 
from  the  fields  of  cognitive  science  and  engineering,  to  guide  the  matching  of  user,  task,  and 
organizational-environmental  requirements  to  available  tools  and  techniques  for  the  design 
of  interaction.  Thus,  one  of  the  principal  goals  of  cognitive  systems  engineering  research  is 
the  identification  and  exploration  of  these  relationships.  Ultimately,  this  research  should 
not  only  guide  the  design  of  new  systems,  but  also  provide  feedback  to  drive  the  develop¬ 
ment  of  the  next  generation  of  technological  resources  [Figure  1].  In  direct  contrast  to  the 
tendency  for  “technology  push”  in  advanced  systems  development,  this  feedback  loop  sup¬ 
ports  requirements-driven  design  and  reinforces  the  links  to  the  human  user. 


Figure  1:  Interaction  of  Task  Requirements  and  UCI  Technology 


This  report  investigates  the  “Data  Wall”  information  presentation  concept  and  its  as¬ 
sociated  interface  technologies  with  respect  to  their  potential  contributions  to  the  critical 
requirements  associated  with  theater-level  command  and  control  (C^)  decision  making. 

2.  Cognitive  Task  Requirements: 

Supporting  Decision  Processes  at  the  Senior  Command  Level 

2.1.  MODELING  COGNITIVE  TASK  REQUIREMENTS 
In  the  simplest  terms,  what  we  are  trying  to  do  with  technology  (the  task  require¬ 
ments)  should  define  what  we  want  technology  to  do  (the  interaction  design).  Andriole 
(1989)  emphasizes  the  importance  of  defining  requirements  for  decision  support  systems  in 
terms  of  the  users,  their  tasks,  and  the  organizational  and  doctrinal  context  in  which  deci¬ 
sions  are  made.  The  user  profile  helps  to  place  the  user  as  a  member  of  the  larger  organiza¬ 
tion;  the  task  and  environmental  analyses  identify  the  system’s  external  links  and  the  con¬ 
ditions  under  which  the  system  operates.  It  is  also  necessary  to  xinderstand  the  organiza- 
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tional  culture  and  how  it  directly,  or  indirectly,  impacts  and  is  impacted  by  the  user  and  the 
tasks.  Organizational  doctrine,  whether  implicitly  or  explicitly  communicated  to  the  user, 
may  provide  not  only  procedural  guidelines  for  structured  tasks,  but  also  a  conceptual  view 
and  global  goals  which  must  be  considered.  Finally,  in  the  course  of  evolutionary  design 
and  implementation,  the  designer  must  be  sensitive  to  the  re-definition  effects  of  the  new 
system  on  the  organization  and  its  doctrine. 

In  systems  designed  to  support  planning  and  decision  making,  it  is  important  to  iden¬ 
tify  the  requirements  associated  with  higher-order  cognitive  processes  involved  in  these 
tasks.  These  cognitive  task  requirements  (CTRs)  provide  a  focal  point  for  integrating  the 
user  profiles,  problem  solving  tasks,  and  the  decision  environments.  In  addition,  cognitive 
task  requirements  (CTRs)  strongly  impact  design  considerations  involving  interaction  con¬ 
trol  and  focus  of  attention.  The  system  designer  must  use  information  from  the  require¬ 
ments  analysis  to  minimize  system  interference  with  task  performance  and  direct  the  user’s 
attention  to  critical  information.  This  requires  understanding  which  cues  will  be  most  ef¬ 
fective  under  specific  conditions.  Particularly  in  cases  where  the  tasks  are  complex  and 
must  be  performed  in  a  dynamic,  time-stressed  environment,  the  designer  needs  such 
information  to  determine  optimal  display  formatting,  information  representation,  and  data 
presentation  rates. 

Currently,  there  are  no  prescriptive  means  of  matching  of  requirements  to  design  fea¬ 
tures.  Nevertheless,  efforts  in  cognitive  systems  engineering  research  have  yielded  some 
promising  approaches  to  help  bridge  the  requirements  to  design  gap.  For  example,  Andriole 
(1989)  employs  a  three-dimensional  matrix  to  represent  requirements  dimensions  and  sug¬ 
gest  a  mapping  to  the  UCI  design.  The  matrix  model  makes  use  of  a  variety  of  taxonomic 
approaches  to  the  classification  of  users  and  tasks  and  is  readily  extensible  to  a  wide  range 
of  system  contexts.  In  related  work,  Andriole,  Ehrhart  and  Aiken  (1991)  apply  this  method 
to  the  design  of  a  large-screen  display  for  theater-level  battle  management.  Their  research 
provides  a  hierarchical  listing  of  user,  task,  organizational-doctrinal  requirements  subse¬ 
quently  used  to  define  a  hierarchical  listing  of  user  interface  design  features.  For  the  pur¬ 
pose  of  this  study,  the  CTRs  were  captured  using  a  combination  of  Andriole’s  matrix  ap¬ 
proach  and  other  task  profiling  techniques  that  captured  the  domain-dependent  character¬ 
istics  of  theater  battle  management. 
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2.2.  COGNITIVE  TASK  REQUIREMENTS  FOR  THEATER  BATTLE  MANAGEMENT 
2.2.1.  The  User/Task/Organizational-Doctrinal  Requirements 

User  Profile.  Theater  commanders  are  called  into  situations  involving  either  joint 
(multi-service)  and/or  combined  (mixture  of  allies)  operations  in  both  limited  and  conven¬ 
tional  warfare.  Despite  extensive  domain  experience,  their  interaction  with  command  sup¬ 
port  systems  may  be  infrequent  and  varied  across  a  range  of  operational  experience.  For 
example,  a  given  command  support  system  used  in  one  theater  might  have  a  moderate  to 
high  frequency  of  use;  however,  normal  manpower  rotation  within  the  services  might  limit 
individual  hands-on  experience  with  the  system.  This  lack  of  uniform,  frequent  experience 
with  computer-based  decision  support  is  further  compounded  by  interface  designs  that  focus 
on  the  delivery  of  analytical  performance  at  the  expense  of  user  awareness  and  comprehen¬ 
sion.  The  result  is  what  Lt.  Gen.  Michael  Nelson  decries  as  the  technologist’s  tendency  to¬ 
ward  a  “blizzard  of  options  —  with  complex  selection  trees  —  and  not  towards  simplicity  at 
the  point  of  employment”  (Nelson,  1993). 

Task  Overview.  Military  decision  making  hinges  on  situation  assessment  and  option 
generation.  The  senior  command  team  begins  with  some  degree  of  uncertainty  and  seeks  to 
reduce  that  uncertainty  with  additional  information  and  evaluations.  Based  on  the  assess¬ 
ment  of  both  the  allied  and  enemy  situations,  alternative  courses  of  action  (COAs)  are  gen¬ 
erated  and  inferences  drawn  regarding  the  possible  outcomes  of  each.  Built-in  iteration 
loops  feed  demands  for  additional  information  and  refinement  of  the  assessment  and  option 
generation  phases.  An  essential  aspect  of  this  planning  process  is  the  development  of  a 
shared  situational  image.  The  commander’s  image  of  the  evolving  situation  and  battle  plan 
is  a  key  component  of  information  communicated  as  the  commander’s  “intent”.  In  pursuit 
of  this  shared  understanding,  planners  use  wall-size  maps  and  moveable  icons,  charts,  and 
3-D  representations  to  focus  their  discussions,  interacting  with  these  displays  and  each 
other  by  drawing  courses  of  action  or  moving  S3rmbols.  In  studies  of  US  Army  corps-level 
planning  groups,  Andriole  discovered  that  their  methods  were  highly  graphic  and  non¬ 
numeric  (Andriole,  1986).  In  addition,  planners  incorporate  analogs  from  previous  battles 
fought  in  the  same  or  similar  situations  and  tactics  employed  in  the  past.  Information  for 
situation  assessment  (i.e.,  weather  and  terrain  data,  bomb  damage  assessment,  textual 
evaluations,  photographs,  and  numerical  analyses)  are  provided  by  support  teams  in  docu¬ 
ments  which  may  be  so  large  as  to  preclude  rapid  review  under  pressure. 

Task  Environment.  The  theater-level  planning  environment  involves  decision  making 
on  a  crisis  and  non-crisis  (strategic  planning  and  exercise)  basis.  In  crisis  mode,  this  envi¬ 
ronment  is  characterized  by  very  high  stakes  and  the  requirement  for  a  short  decision  cycle. 
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Since  an  independently  operating  adversary  is  involved,  the  planners'  decision  cycle  must 
be  shorter  than  that  of  the  adversary  to  ensure  campaign  success.  Combined  operations 
present  new  and  unique  problems  in  command  and  control.  Relationships  with  allies  and 
host  nations  limit  the  strength  of  US  forces  through  political  agreements,  stationing  re¬ 
strictions,  force  ceiling  restrictions,  and  budgetary  constraints,  etc.  Decision  support  sys¬ 
tems  in  this  domain  must  assist  this  process,  promote  and  facilitate  group  interaction,  and 
aid  the  generation  and  evaluation  of  alternative  courses  of  action  (COAs). 

Doctrinal  and  Organizational  Considerations.  Any  profile  of  military  planning  re¬ 
quires  an  understanding  of  command,  both  the  theoretical  definitions  and  command  as  de¬ 
fined  by  the  current  doctrine  of  the  organizations  invoh'ed.  This  doctrine  constitutes  the 
organized  knowledge  of  strategy  and  tactics  employed  to  achieve  a  military  goal.  High-level 
goals  and  objectives  are  determined  by  orders  from  the  Commander-in-Chief  and  are  fur¬ 
ther  defined  within  the  context  of  guidelines  /  restrictions  from  Congress  and  the  allied  gov¬ 
ernments.  While  most  of  these  relationships  are  fixed  and  formal,  doctrine  is  a  dynamic 
and  evolving  theory  on  the  command  of  military  forces. 

Planning  and  decision  making  for  modem  warfare  require  the  integration  of  multiple 
(often  overlapping)  information  sources  developed  independently  with  varying  degrees  of 
relevance,  precision,  and  timeliness.  The  resolution  of  uncertainty  is  not  uniquely  a  func¬ 
tion  of  intelligence  fusion;  it  requires  a  combination  of  information  processing,  intelligence 
analysis,  and  the  collaboration  among  a  team  of  highly  experienced  senior  officers.  The  hi¬ 
erarchical  structure  of  military  organizations  plays  a  major  role  in  the  dynamics  of  the 
group  interactions.  Each  level  in  the  organization  senses  as  a  filter  to  synthesize,  summa¬ 
rize  and  present  information  upward  to  the  next  level.  Traditional  information  processing 
approaches  to  command  and  control  tend  to  focus  on  computational  tasks  without  integrat¬ 
ing  the  input  of  team  interaction  on  the  problem.  Naturally  enough,  these  systems  do  not 
provide  any  means  for  facilitating  team  dialogue.  This  segregation  tends  to  result  in  the 
relegation  of  computer-based  support  to  lower  echelons  or  limited  use  as  briefing  support. 

2.2.2.  CTR  Summary 

Based  upon  the  assessment  of  the  user,  task,  organizational-doctrinal  requirements,  it 
is  possible  to  summarize  the  cognitive  task  requirements  to  include  the  following: 

•  Support  for  infrequent,  non-uniform  system  experience  -  highly  transparent, 
“natural”  interaction,  permitting  users  to  literally  walk-up  and  use  the  system. 

•  Support  for  planning  and  decision  making  tasks  at  the  most  meaningful  level  of  ab¬ 
straction  -  the  right  information  at  the  right  time  in  the  right  form. 
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•  Support  for  sharing  images  —  facilitating  collaboration  on  situation  assessment, 
understanding  and  evaluating  options,  communicating  the  commander’s  intent  to 
operational  forces,  and  incorporating  feedback  on  progress  of  operations. 

•  Support  for  handling  information  overload  -  rehef  from  the  burden  of  manually 
reviewing  detail  without  sacrificing  awareness  of  critical  factors  and  developments; 

•  Support  for  mediating  the  potential  errors  associated  with  planning  and  team  de¬ 
cision  making  -  cues  to  the  accurate  interpretation  of  information  that  take  into 
account  the  degree  of  certainty,  precision,  and  completeness  associated  with  the  in¬ 
formation  source. 

•  Support  for  integrating  the  collaborative  nature  of  battle  planning  with  the  support 
system  interaction  requirements. 

The  next  section  examines  the  potential  application  of  information  presentation  and  inter¬ 
action  technologies  to  meet  these  high-level  requirements. 

3.  Technology  Resources  for  Information  Presentation  and  Interaction  in  the 

Theater  Command  Center 

Since  the  earliest  writings  on  command  in  warfare,  military  strategists  attempted  to 
characterize  the  commander’s  information  requirements  with  respect  to  dimensions  such 
as  timeliness,  degree  of  certainty,  level  of  detail,  etc.  One  of  the  most  promising  generic 
models  categorizes  information  in  terms  of  pipelines,  alarms  and  trees  (Kahan  et  al, 
1989) .  Pipelines  present  information  in  set  order,  set  format,  and  set  level  of  detail  as  a 
one-way  “push.”  This  mode  trades  off  lower  timeliness  (i.e.,  slower  delivery)  against  the 
value  of  a  lower  degree  of  uncertainty.  Thus,  its  principal  advantage  lies  in  its  uniformity 
and  consistency  as  a  baseline.  Since  the  information  variables  must  be  known  in  advance, 
this  mode  is  inadequate  when  the  information  needs  are  context-dependent  or  the  input 
variables  are  unknown.  At  the  opposite  end  of  the  spectrum,  alarms  (or  interrupts)  bal¬ 
ance  high  timeliness  and  detail  against  highly  uncertain  information.  Some  alarms  may  be 
set  in  advance;  others  must  be  tailored  to  suit  contingencies.  Highly  context-dependent  sit¬ 
uations  require  demand-driven,  branching  inquiry  -  or  trees.  Trees  involve  varying 
degrees  of  timeliness  and  detail,  but  are  usually  pulled  from  more  certain  sources.  Effective 
support  for  information  interaction  in  each  of  these  modes  requires  the  identification  of  UCI 
concepts  that  facilitate  more  direct  image  sharing  and  provide  the  capability  to  mix  supply - 
push  and  demand-pull  interaction. 


2-9 


3.2.  MULTIMEDIA  INFORMATION  PRESENTATION  RESOURCES 

A  comprehensive  body  of  research  in  memory  and  learning  supports  the  value  of  mul¬ 
tiple  communication  modes  and  presentation  media.  The  theoretical  literature  extended 
many  of  these  concepts  to  a  larger  model  of  human  information  processing.  From  the 
perspective  of  the  UCI  designer,  the  utility  of  multiple  information  presentation  media  may 
be  viewed  in  terms  of  an  expansion  of  the  information  communication  bandwidth  between 
the  computer-based  support  system  and  the  human  user.  At  each  level  in  the  command 
and  control  hierarchy,  command  decision  makers  develop  their  situational  images  from 
multiple  views  of  the  battlefield  (i.e.,  tabular  data,  mission  flow  charts,  maps,  sensor 
displays,  etc.).  In  addition  to  the  more  traditional  presentation  media,  there  is  a  growing 
interest  in  exploring  the  value  of  incorporating  high  resolution  visual  information,  sound, 
speech,  and  video.  Each  of  these  capabilities  support  the  senior  command’s  information 
access  requirements  (pipelines,  alarms,  trees)  in  varying  degrees.  Moreover,  information 
that  must  be  “pulled”  in  tree  inquiries  by  one  level  in  the  hierarchy  becomes  part  of  the 
information  that  is  “pushed”  to  the  next  level  in  the  form  of  briefings  or  situation  updates. 
For  example,  high  resolution  photographic  imagery,  a  critical  part  of  the  intelligence  input 
for  targeting  and  bomb  damage  assessment,  is  pulled  by  tasking  resources  to  collect  it  then 
pushed  to  the  next  decision  making  level  as  part  of  the  intelligence  assessment. 

In  addition  to  more  traditional  non-text  media  such  as  photographic  imagery,  the  ca¬ 
pability  currently  exists  to  communicate  information  to  the  senior  command  in  real  time  or 
from  recorded  sources  using  speech,  sound,  and  video  and  graphics.  Although  research 
tends  to  focus  on  speech  as  an  input  medium,  live  or  recorded  speech  may  provide  out  put 
as  verbal  communication  from  field  units  to  supplement  both  standard  (pipeline)  and  emer¬ 
gency  (alarm)  communication.  Speech  may  also  be  a  part  of  human  intelligence  informa¬ 
tion  (e.g.,  recorded  statements  from  adversaries)  or  supplementary  verbal  explanation  at¬ 
tached  to  graphic  and  photographic  data.  In  addition,  synthesized  speech  is  an  effective 
feedback  mechanism  for  certain  interaction  tasks  (e.g.,  notification  of  the  completion  of  a 
background  process).  Sound  is  often  associated  with  system  alarms;  moreover,  sound  in  the 
form  of  auditory  icons  (Gaver’s  “earcons”)  supports  the  perceptual  integration  of  multiple 
processes  in  the  form  of  a  single  audio  cue  (Gaver,  1991).  Similarly,  audio  feedback  incor¬ 
porated  into  visualization  routines  provides  additional  cues  to  assist  in  the  comprehension 
of  complex,  multidimensional  data. 

Video  combines  a  wide  range  of  still  and  motion  visual  imagery  with  speech  and  sound 
to  provide  multimodal  communication  of  complex  information.  Graphic  simulation  and 
multidimensional  information  visualization  project  dynamic,  evolving  representations  of 
highly  complex  fused  data  in  a  form  more  readily  comprehensible  to  human  decision 
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makers.  Information  presentation  techniques  such  as  stereoscopic  3-D  projection  and  holo¬ 
graphic  imagery  allow  users  to  “view”  the  battlefield  (including  aerospace  dimensions)  in 
ways  previously  limited  by  human  perceptual  capabilities.  Similarly,  the  multidimensional 
display  of  typically  non-visible  data  (e.g.,  sensor  data)  permits  users  to  explore  relation¬ 
ships  and  discover  meanings  in  highly  complex  situations. 

These  technologies,  employed  separately  and  combined,  provide  a  potential  enhance¬ 
ment  to  situational  awareness  in  the  command  center.  Although  electronic  transmission 
bandwidth  and  throughput  continues  to  present  problems  for  theater-wide  dissemination  of 
multimedia  information,  a  variety  of  sampling  and  compression  schemes  exist  which  con¬ 
siderably  reduce  the  bandwidth  requirements  associated  with  information-rich  media. 

Much  of  this  technology  exists  today  in  a  “fieldable”  form  suitable  for  use  in  the  theater  and 
force-level  command  centers. 

3.3.  REAL-TIME  INTERACTIVE  INFORMATION  RESOURCES 

The  various  multimedia  resources  also  support  two-way,  or  interactive,  information 
communication.  Speech  and  video  facilitates  distributed  team  decision  making  with  real¬ 
time  communication  between  and  among  the  upper  levels  of  joint  and/or  combined  forces. 
Interactive  simulation  allows  the  commander  to  explore  not  only  the  current  situation  and 
recent  history,  but  also  to  project  the  consequences  or  effects  of  the  various  courses  of  action 
available.  Animated  graphics,  for  example,  provide  the  dimension  of  time  to  unfolding 
events.  Multidimensional  visualization  permits  the  decision  maker  to  “tweak”  problem  pa¬ 
rameters  and  view  the  subsequent  effects  on  other  factors. 

Virtual  reality,  although  still  earliest  phases  of  development,  constitutes  the  most 
dramatic  advance  in  interacting  with  situational  information.  Current  technology  allows  a 
wide  range  of  interaction  levels  from  full  sensory  emersion  in  the  artificial  environment  to 
non-emersive  manipulation  of  virtual  objects.  Through  the  use  of  this  technology,  decision 
makers  may  examine  courses  of  action  without  risk  (i.e.,  route  planners  may  pre-fly  strike 
routes  through  threat  zones  in  unfamiliar  territory)  or  explore  situations  at  distant  and/or 
dangerous  locations  (e.g.,  commanders  may  “tour”  the  target  area  to  understand  the  contin¬ 
gent  targets).  Additional  possibilities  exist  for  allowing  humans  to  “visit”  or  interact  with 
environments  unavailable  to  human  beings  by  compressing  or  expanding  the  level  of  detail 
in  a  multidimensional  information  set  to  facilitate  the  decision  maker’s  understanding  of 
trends  or  patterns  in  data. 

Most  of  these  capabilities  exist  today  in  stsmdalone  prototype  form.  Research  to  eval¬ 
uate  their  use  and  impact  on  task  performance  typically  is  conducted  without  the  context  of 
a  larger  interaction  design  incorporating  various  information  presentation  media  and  inter- 
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action  modes.  There  is  a  critical  need  for  integrative  research  to  investigate  the  impacts  of 
combining  technologies  and  interaction  concepts  in  specific  task  environments. 

4.  The  Data  Wall: 

Integrating  Multimedia  Information  for  Theater  Battle  Management 

4.1.  THE  “DATA  WALL”  CONCEPT 

The  Advanced  Displays  and  Intelligent  Interfaces  Laboratory  (ADII)  in  the  C3  Divi¬ 
sion  at  Rome  Laboratory  is  currently  engineering  an  integrated  environment  to  explore  the 
design  possibilities  afforded  by  combining  multimedia  information  and  multimodal  UCI 
technologies  to  meet  the  requirements  of  theater  battle  management.  The  “Data  Wall”  is 
actually  a  seamless  wall-size  display  created  using  multiple  high-resolution  monitors  indi¬ 
vidually  driven  by  separate  processors  controlled  in  a  locally  distributed  architecture.  This 
design  permits  the  direct  display  (without  intermediate  frame  buffering)  of  a  single  image 
across  all  of  the  monitors  and  the  ability  to  open  multiple  windows  for  individual  or  com¬ 
bined  processes. 

The  Data  Wall  will  support  a  full  range  of  information  presentation  media  (e.g.,  pho¬ 
tographic  imagery,  sound,  speech,  video,  graphics,  alphanumeric,  etc.)  and  permit  the  com¬ 
mand  decision  team  to  interact  with  each  other  and  the  information  through  the  use  of  ges¬ 
ture  and  speech.  These  interaction  modes  all  the  more  natural  interaction  the  users  expe¬ 
rience  with  more  traditional  information  displays  such  as  wall-size  maps.  In  addition,  the 
transparent  interaction  (that  is,  without  requiring  an  understanding  of  system  interaction 
routines  or  the  use  of  keyboards,  etc.)  permits  the  first-time  user  “walk  up  and  use”  access 
to  the  system.  This  is  particularly  important,  since  the  Data  Wall  is  intended  to  be  a  the- 
ater-tailorable  system  and  may  vary  widely  in  input  and  output  configuration  based  upon 
the  resources  afforded  by  the  contingent  operational  environment.  The  Data  Wall  in  the 
ADII  Laboratory  will  permit  the  investigation  of  the  individual  and  combined  effects  of  var¬ 
ious  modes  of  interaction  and  information  presentation  within  the  context  of  team  decision 
making. 


4.2.  USER  INPUT;  ADDRESSING  THE  DATA  WALL 
The  introduction  of  computer-based  decision  aids  often  results  in  the  redirection  of 
much  of  the  effort  of  individual  team  members  from  interaction  with  each  other  and  the  de¬ 
cision  information  to  interaction  with  the  computer  support  tool.  One  of  the  goals  of  the 
Data  Wall’s  “transparent  interaction”  paradigm  is  to  permit  the  user  to  focus  attention  on 
the  tasks  and  the  team  rather  than  the  system.  This  is  accomplished  by  allowing  the  user 
to  address  the  Data  Wall  using  the  natural  communication  methods  of  speech  and  gesture 
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which  normally  accompany  interpersonal  interaction.  The  specific  means  and  devices  for 
supporting  multimodal  communication  in  a  team  context  introduces  one  of  the  pivotal 
research  and  design  issues  for  the  Data  Wall. 

The  most  readily  available  mechanism  for  gestural  interaction  is  the  direct  touch  (or 
touch  screen)  interface.  The  obvious  disadvantages  of  direct  touch  interaction  are  the  re¬ 
quirement  to  operate  “within  arms  reach”  and  the  potential  for  masking  portions  of  the 
display  during  input  with  the  hand  or  pointer.  The  former  may  be  addressed  with  the  use 
of  a  long  pointer;  however,  in  team  activities,  this  raises  the  additional  issues  of  who  will 
have  pointers.  Either  passing  a  single  pointer  among  participants  or  having  each  partici¬ 
pant  use  a  separate  pointer  seems  likely  to  interfere  with  both  the  team  interaction  and 
problem  solving  processes.  Most  of  the  non-touch  activated  pointing  devices  (i.e.,  light 
beam,  glove,  etc.)  carry  with  them  similar  limitations.  It  would  seem  desirable  to  retain  the 
naturalness  of  direct  touch  while  extending  the  “reach”  and  incorporating  more  of  the  ex¬ 
pressive  vocabulary  inherent  in  hand  motions.  Device-free  gestural  interaction  using  video 
camera  recognition  of  user  gestures  is  under  development;  however,  the  current  protot5rpes 
are  limited  to  single,  stationary  users  operating  in  extremely  close  proximity  to  the  camera. 

Gestural  interaction  would  appear  to  reduce  cognitive  workload  in  two  ways.  First,  it 
reduces  short  term  memory  requirements  in  that  input  and  output  are  accomplished  almost 
simultaneously  at  the  same  location.  Second,  it  reduces  the  requirement  for  recall  of  com¬ 
mands  or  interaction  sequences  from  long-term  memory  by  permitting  direct  interaction 
through  natural  gesture.  However,  gestural  interaction  shares  one  of  the  principal  limita¬ 
tions  of  the  most  mundane  direct  manipulation  interfaces:  in  order  to  act  on  an  object,  the 
object  must  be  visible.  For  example,  to  select  a  menu  item,  the  user  may  have  to  proceed 
sequentially  through  a  series  of  menus  or  traverse  a  layered  hierarchy  of  menu  options. 

The  direct  access  afforded  by  command  interaction  is  not  available,  except  as  explicitly  de¬ 
signed  into  iconic  functions.  None  of  the  purely  gestural  interaction  paradigms  surmounts 
this  issue. 

The  addition  of  speech  to  gestural  interaction  permits  a  more  robust  specification  of 
the  object(s)  of  interest  and  action(s)  required.  Speech-based  interaction  permits  the  com¬ 
mander  or  staff  officer  to  address  the  Data  Wall  as  they  would  another  team  member.  The 
decision  maker  can  ask  directly  for  information  (e.g.,  “Where  are  the  active  SAM  sites?”) 
and  probe  for  explanations  of  computer-generated  options.  Currently,  practical  application 
of  speech  understanding  is  limited  to  non-continuous  (or  discrete)  speech.  The  command 
center  tasks,  organization  and  environment  mandate  input  modes  that  will  accommodate 
multiple  users  working  together  and  independently.  The  users’  tasks,  situational  environ- 
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ment  and  organizational  structure  significantly  impacts  the  determination  of  an  appropri¬ 
ate  mode  of  communication  and  interaction.  Further  research  is  required  in  a  variety  of  in¬ 
put  methods  to  ascertain  their  suitability  to  tasks  and  user  preference  under  task  condi¬ 
tions. 


4.3.  SYSTEM  OUTPUT;  INFORMING  THE  USER 

The  Data  Wall  concept  involves  the  capability  to  present  and  interact  with  informa¬ 
tion  through  multiple  windows.  This  feature  will  allow  the  command  team  to  work  individ¬ 
ually  and  together  with  reference  to  task  specific  information  while  maintaining  a  situa¬ 
tional  overview  through  maps  and  detail  displays.  There  are  a  variety  of  configuration 
options  for  windows  and  multiple  displays.  Multiple  windows  may  be  used  to  display  tem¬ 
poral  or  spatial  ordering  and  provide  comparisons  to  system  generated  models.  Users  may 
also  zoom  windows  in  and  out  to  facilitate  examination  of  detail.  The  use  of  multiple,  S5m- 
chronous  windows  can  also  serve  to  provide  insights  to  the  effects  on  one  model  based  on 
changes  in  another.  Intelligent  windows  are  a  powerful  way  to  organize  the  display  of  mul¬ 
tiple  processes.  The  development  of  techniques  to  relate  windows  and  create  dependencies 
within  contexts  may  result  in  a  dramatic  change  in  the  way  users  interact  with  computers. 
In  the  1970’s  the  Architecture  and  Machine  Group  Laboratory  (now  known  as  the  Media 
Lab)  at  MIT  developed  an  entirely  new  dimension  in  user  interface  built  upon  concepts  of 
spatial  articulation  of  data.  The  user  interacts  with  a  wall-size  display  incorporating  tradi¬ 
tional  alphanumeric  displays,  still  frames,  video,  and  computer  animation  in  window-like 
formats.  A  variety  of  interaction  devices  allow  the  user  to  'turn'  pages  of  text,  zoom  in  on 
information,  and  access  information  within  the  data  base.  Current  research  is  focusing  on 
new  technology  which  will  allow  the  user  to  interact  with  the  system  through  speech  ges¬ 
tures,  eye  movements,  voice  commands,  and  similar  non-mechanical  means  to  achieve  a 
higher  degree  of  interface  transparency. 

In  a  literal  interpretation  of  human-computer  communication  or  human-computer  in¬ 
teraction,  Bpdker  (1991)  suggests  that  designers  consider  human-computer  communication 
in  terms  of  interpersonal  communication.  This  approach  conceptualizes  the  computer  as  an 
active  partner  in  task  performance  and  is  reflected  in  research  efforts  to  develop  associate 
technologies  (e.g..  Pilot’s  Associate)  and  intelligent  agents  that  track  activities  and  perform 
tasks  individualized  to  the  context  and  the  operator  they  “represent.”  The  intelligent  multi¬ 
agent  model  bears  some  resemblance  to  the  blackboard  paradigm  in  which  multiple  pro¬ 
cesses  contribute  problem  solving  input  from  their  assigned  areas.  The  use  of  agents  to 
support  decision  tasks  will  most  likely  require  the  means  (similar  to  the  blackboard  man¬ 
ager)  to  assess  the  various  contributions  from  multiple  agents.  In  lieu  of  such  management 


2-14 


paradigms,  the  user  is  once  again  faced  with  an  onslaught  of  information  from  the  various 
agents  competing  for  attention. 

Clearly,  this  paradigm  is  not  appropriate  for  all  applications.  Furthermore,  concep¬ 
tualization  of  the  computer  as  a  human-like  partner  may  actually  limit  the  full  realization 
of  the  technological  advantages.  However,  the  intelligent  agent  need  not  take  on  exces¬ 
sively  human  characteristics.  The  principal  goal  of  intelligent  associate  or  agent  paradigms 
is  the  delegation  of  certain  tedious,  repetitive  tasks  to  the  system  in  order  to  free  the  user  to 
pursue  higher-order  cognitive  activities.  The  potential  for  their  use  in  a  team  decision  envi¬ 
ronment  depends  upon  a  better  understanding  of  the  role(s)  agents  may  play,  how  they  will 
be  linked  to  individual  users  or  user  tasks,  and  how  multiple  agents  may  communicate  with 
the  decision  making  team  without  creating  chaos  in  the  command  center. 

5.  UCI  Research  Directions:  Exploring  the  Potential  Impacts  of  Data  Wall 
Technologies  on  the  Command  Process 

5.1.  ENHANCING  DECISION  PROCESSES 

In  the  rush  to  bring  new  support  technology  to  the  users,  system  designers  should 
take  care  to  retain  and  extend  technological  support  to  those  aspects  of  the  decision  pro¬ 

cess  that  work  well  today.  The  UCI  design  for  the  Data  Wall  must  facilitate  the  team  in¬ 
teraction  critical  to  successful  campaign  planning.  For  example,  while  workstation-based 
high-definition  maps  are  useful  for  analysts,  the  command  team  use  the  wall  map  as  a 
frame  of  reference  and  communication  tool  in  group  discussions.  The  Data  Wall  enhances 
interaction  with  the  static  map  to  permit  dynamic  mapping  capabilities  while  retaining  the 
discussion  and  gestural  modes  of  communication.  Furthermore,  the  Data  Wall  enhances  in¬ 
formation  presented  through  the  map  by  using  the  map  as  a  spatial  reference  and  access 
point  to  situational  detail  in  the  form  of  simulation,  imagery,  fused  intelligence,  etc.  Real¬ 
time  video  and/or  audio  further  enhances  distributed  team  interaction  by  allowing  the 
senior  command  face-to-face  communication  with  in-theater  command  posts. 

In  addition  to  enhancing  current  capabilities,  the  Data  Wall  has  strong  potential  to 
address  some  of  the  current  problems  in  information  presentation  for  command  decision 
making.  The  multimodal  interaction  and  multimedia  information  presentation  architecture 
of  the  Data  Wall  design  incorporates  the  layered  information  access  concept  endorsed  in  the 
C^I  for  the  Warrior  objectives.  Windowing  against  a  geographic  reference  permits  the 
decision  maker  to  control  the  level  of  detail  desired  without  sacrificing  awareness  regarding 
the  situational  context.  Thus,  alarms  and  updates  relevant  to  background  processes  can  be 
presented  on  the  large  map  to  reference  context. 
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Research  indicates  that  overloading  the  human  decision  maker  with  information  re¬ 
sults  in  decisions  based  on  the  evaluation  of  only  a  portion  of  the  available  information. 
Filtering  (screening  out  everything  except  what  is  relevant  to  current  decision)  and  fusion 
(aggregating  data  to  appropriate  level  of  abstraction  for  problem)  may  result  in  several 
sources  for  decision  error.  For  example,  pieces  that  don’t  fit  tend  to  tending  to  get  reshaped 
or  dropped.  Inconsistencies  disappear  and  quantitative  discrepancies  get  averaged  out. 
Information  displays  may  suppress  or  mask  uncertainty  resulting  from  such  factors  as  sen¬ 
sor  range/resolution  limitations,  environmental  constraints,  human  assumptions  and  bias, 
imprecise  or  ambiguous  language,  interference,  enemy  deception  and  countermeasures. 

The  Data  Wall’s  application  of  intelligent  agents  provides  the  means  to  both  buffer  the  user 
from  detail  and  maintain  a  less  biased  appraisal  of  situational  data.  For  example  ,  an  agent 
which  has  been  “trained”  to  consider  the  impacts  of  imprecision  or  completeness  in  evaluat¬ 
ing  data  will  “remember”  to  annotate  the  presentation  of  situational  displays  with  qualify¬ 
ing  information  for  evaluation  by  the  decision  maker.  Furthermore,  the  agent  presents  ad¬ 
vice  unaffected  by  the  impacts  of  time  pressure  or  perceptual  bias  within  the  constraints  of 
its  programming. 


5.2.  SPAWNING  NEW  COGNITIVE  TASK  REQUIREMENTS 

The  introduction  of  multimedia  information  presentation  and  multimodal  interaction 
into  the  current  command  and  control  decision  environment  will  effect  the  team  decision 
making  process.  Design  and  development  of  the  Data  Wall  system  concept  must  incorpo¬ 
rate  the  assessment  of  new  cognitive  requirements  potentially  imposed  by  the  various  ad¬ 
vanced  technologies  and  interaction  designs  considered.  For  example,  the  simple  use  of 
multiple  windows  central  to  the  Data  Wall  concept  has  several  implications  for  individual 
and  team  use  in  the  command  center  environment.  Improperly  designed,  windowing  and 
hierarchical  menu  structures  can  result  in  the  user  “getting  lost”  in  a  multiple  display  sys¬ 
tem  (Woods,  1984).  Woods  has  coined  the  term,  visual  momentum,  to  describe  the  measure 
of  the  user’s  ability  to  draw  out  and  synthesize  information  across  a  series  of  displays.  Low 
visual  momentum  results  in  a  decrease  in  the  field  of  attention  or  cognitive  ‘tunnel  vision 
resulting  the  failure  to  process  important  information  and  a  potential  for  getting  lost  in  the 
network  of  displays,  menus,  and  windows.  These  problems  are  linked  to  a  farther  increase 
in  mental  workload  and  degraded  problem  solving  performance. 

There  are  several  techniques  for  improving  visual  momentum.  The  concept  of  an 
‘establishing  shot’  or  ‘long  shot’  from  motion  visual  communication  can  be  applied  in  the 
form  of  process  or  system  overviews.  Successive  displays  can  also  be  linked  perceptually 
through  the  use  of  fixed  screen  formats,  perceptual  landmarks,  and  display  overlays.  Over- 
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lays,  commonly  used  in  map  displays  permit  the  introduction  of  new  information  without 
changing  the  basic  display.  Spatial  formats  can  provide  information  similar  to  the  long 
shot  by  displaying  temporal  or  hierarchical  relationships  in  data  elements.  Moreover,  spa¬ 
tial  displays  have  analogical  aspects  which  can  significantly  reduce  the  mental  workload  of 
the  user. 

In  addition  to  the  cognitive  issues  associated  with  the  interaction  concept,  the  Data 
Wall  design  must  address  the  issues  of  communication  and  interaction  control  arising  from 
having  multiple  users  at  a  common  display.  The  team  members  must  have  a  clearly  de¬ 
fined  and  understood  ability  to  control  and  direct  the  activities  of  the  Data  Wall.  Further¬ 
more,  output  directed  toward  the  group  must  be  discriminated  from  process  output  from 
team  members’  individual  inquiries.  These  requirements  imply  an  intelligent  interaction 
design  that  is  “aware”  of  the  individual  users.  Finally,  the  Data  Wall  design  must  also 
consider  the  learning  and  training  requirements  for  system  use  --  including  initial  learning, 
retention,  and  extension  of  skills.  Each  is  impacted  by  the  nature  of  the  users,  the  tasks 
performed,  and  the  frequency  and  circumstances  of  use.  Additional  tradeoffs  exist  between 
ease  of  learning  and  usability  versus  problem  solving  power  and  task  utility.  Research  with 
operationally  qualified  users  performing  representative  tasks  wdl  permit  exploration  to 
determine  the  kinds  of  learning  and  performance  aids  required. 

5.3  EXPLORING  &  EVALUATING  NEW  INTERACTION  C  ONCEPTS 

The  Data  Wall  is  a  high-risk,  investment-intensive  concept.  Multimedia/multimodal 
interfaces  integrate  input  modes  and  information  representations  which  were  previously 
unavailable  to  users.  Design  decisions  regarding  the  Data  Wall’s  information  presentation 
and  interaction  technologies  must  be  based  on  their  potential  contribution  to  improved 
communication  with  the  user,  rather  than  their  novelty  appeal.  The  effectiveness  of  tech¬ 
nology  in  an  organization  is  often  limited  by  a  tendency  to  plan  new  systems  solely  in  terms 
of  existing  systems  or  manual  procedures.  At  best,  this  results  in  incremental  changes, 
often  only  “patching”  the  old  systems  rather  than  any  true  innovation  in  technological  sup¬ 
port.  To  fully  exploit  the  capabilities  of  new  technology,  an  effective  systems  planning 
methodology  must  overcome  these  more  limited  visions.  For  example,  Curtice’s  (1987) 
Strategic  Value  Analysis  (SVA),  focuses  on  the  organization’s  fimctional  requirements  and 
the  technological  opportunities  for  supporting  those  requirements,  rather  than  narrowly 
viewing  new  systems  in  terms  of  problems  vdth  existing  systems.  The  design  and  develop¬ 
ment  process  should  incorporate  guidance  from  technological  forecasting,  theoretical  htera- 
ture,  and  context- specific  research  findings. 
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As  presented  in  this  paper,  the  Data  Wall’s  multimedia/multimodal  architecture  de¬ 
pends  upon  several  pivotal  technological  issues.  In  addition  to  the  computational  architec¬ 
ture  requirements,  the  Data  Wall  designers  must  explore  the  requirements  and 
implications  of  the  following: 

•  Multimodal  Input  -  Identification  and  evaluation  of  speech  and  gestural  input 
modes  with  reference  to  the  team  environment  and  decision  making  tasks  in 
dynamic,  complex  environments. 

•  Multimedia  Output  -  Identification  and  evaluation  of  “user-aware”  modes  of 
providing  feedback  on  processes  and  presenting  user-controlled  information. 

•  Intelligent  Interaction  -  Preserving  user  control  in  a  team  environment,  while 
providing  support  for  robust  situational  awareness. 

Increasing  the  synergy  between  technology  development  and  organizational  task 
requirements  is  key  to  leveraging  advanced  interface  technology  against  the  changing 
requirements  in  command  and  control.  The  successful  development  of  advanced  technologi¬ 
cal  concepts  such  as  the  Data  Wall  depend  upon  a  comprehensive,  requirements-driven 
design  approach.  Guidelines,  standards  and  research  on  individual  components  are  insuf¬ 
ficient  for  answering  the  key  questions.  The  successful  development  of  the  next  generation 
of  command  and  control  support  systems  will  require  a  better  understanding  of  the  inter¬ 
actions  and  interdependencies  between  technologies,  users,  organizations,  and  tasks.  This 
understanding  must  be  based  in  part  on  findings  from  context-specific  research.  The  ADII 
Data  Wall  Laboratory  affords  the  exploration  of  these  issues  by  facilitating  context-specific 
research  on  the  application  of  a  wide  range  of  advanced  UCI  technologies  and  interaction 
concepts  to  a  variety  of  military  and  non-military  operational  contexts. 
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ABSTRACT 

Traditional  tactical  and  theater  military  communication 
networks  are  characterized  by  relatively  low  bandwidth  links.  The 
environment  is  dynamic  in  the  sense  that  the  links  are  subject  to 
jamming  and  the  nodes  to  destruction  by  the  enemy.  Modern  and 
future  military  equipment  and  tactics  require  the  use  of  wideband 
links  to  exchange  bandwidth-intensive  information  such  as  video  and 
images.  However,  current  and  proposed  wideband  networks  such  as 
ATM  have  been  designed  for  peacetime,  i.e.  well-behaved  operation. 
The  research  described  herein  proposes  and  develops  an  original 
Fast  ATM  Routing  (FAR)  protocol  which  adapts  wideband  ATM  networks 
for  operation  in  the  dynamic  theater  environment.  It  is  shown  that 
the  FAR  protocol  provides  robust  performance  in  such  an 
environment. 
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A  FAST  ATM  ROUTING  ALGORITHM 
FOR  THE  DYNAMIC  THEATER  ENVIRONMENT 

Robert  R.  Henry  and  Claud  K.  Jones 

INTRODUCTION 


The  research  performed  this  summer  by  the  authors  consists  of 
the  development  and  evaluation  of  a  protocol  to  suitably  adapt  the 
Asynchronous  Transfer  Mode  (ATM)  implementation  of  integrated 
broadband  networking  to  the  requirements  of  a  Theater  Environment 
Network  (TENet) .  This  research  area  was  previously  studied  [1] 
to  identify  ways  in  which  the  advantages  of  highly  adaptable 
dynamic  routing  protocols  could  be  combined  with  the  broadband 
capacity  of  ATM.  This  report  is  divided  into  four  parts.  The 
first  describes  the  relationship  between  this  research  effort  and 
other  research  efforts  now  being  conducted  or  recently  completed  at 
Rome  Labs.  The  second  describes  the  protocol  developed  by  the 
authors  this  summer.  The  third  presents  the  results  of  a 
deterministic  performance  analysis  of  the  protocol,  and  the  fourth 
describes  and  presents  the  results  of  a  detailed  simulation  of  the 
protocol  using  the  OPNET  simulator. 


RELATIONSHIP  TO  CURRENT  PROJECTS 


Current  Projects 

The  research  performed  by  the  authors  this  summer  was  chosen 
to  complement  and  support  current  research  efforts  undertaken  by 
Rome  Labs.  One  such  project  is  the  Evaluation  and  Development  of 
Multimedia  Networks  Under  Dynamic  Stress  (EDMUNDS)  project  [2]. 
It  is  an  ongoing  effort  to  develop  Secure  Tactical  Internet 
Protocols  (STIP)  that  respond  well  to  a  stressed  tactical 
environment.  Such  an  environment  is  characterized  by  communication 
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links  subject  to  jamming  and  nodes  subject  to  destruction  by  the 
enemy.  The  assumption  is  that  the  links  have  relatively  low 
bandwidth,  and  that  there  is  sufficient  time  to  perform 
sophisticated  processing  to  determine  optimal  datagram  routing. 

The  Secure  Survivable  Communication  Network  (SSCN)  project 
[3]  was  initiated  in  an  effort  to  utilize  the  emerging  Broadband 
Integrated  Services  Digital  Network  (B-ISDN)  for  military 
applications.  This  network  relies  on  the  Asynchronous  Transfer 
Mode  (ATM)  to  provide  rapid  multiplexing  and  routing  of  data  over 
wide  bandwidth  links.  Due  to  high  data  rates  there  is  relatively 
little  time  to  determine  the  optimal  route  of  each  packet  (cell) . 
Therefore  connection  oriented  services  have  been  selected  to 
provide  routing  that  is  fixed  for  the  call  duration.  This  is  in 
direct  contrast  with  the  STIP  routing  protocol. 

Two  other  projects,  the  Media  Resource  Controller  (MRC)  [4] 
and  the  Multimedia  Communication  Capability  (M2C2)  [5],  are 

similar  to  the  EDMUNDS  project  in  the  sense  that  the  routing  is 
adaptive  and  the  links  are  low  bandwidth.  However,  these  projects 
have  actually  been  implemented  and  demonstrated  in  the  laboratory. 
Another  laboratory  oriented  project  is  the  Advanced  Multi-Media 
Information  Distribution  System  (AMIDS)  [6]  in  which  a 
distributed  Tactical  Air  Control  Center  (TACC)  is  interconnected  by 
fixed  wideband  links. 

ATM  and  AMIDS  have  high  capacity  links  in  contrast  with  the 
lower  capacity  MRC  and  STIP  links.  Each  may  be  classified  into  one 
of  two  groups  based  on  link  capacity  and  the  ability  to  adapt  to  a 
stressed  environment  as  follows: 

•  ATM/AMIDS:  high  capacity,  low  adaptability 

•  STIP/MRC/M2C2 :  low  capacity,  high  adaptability. 

Interconnection  between  these  two  dissimilar  groups  requires 
careful  consideration  so  that  performance  does  not  revert  to  the 
least  common  denominator,  i.e.  low  capacity  and  low  adaptability. 
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Figure  1  illustrates  the  networks  in  these  two  groups  in  more 
detail.  As  can  be  seen  ATM  represents  the  highest  in  link 
capacity,  while  STIP  represents  a  high  degree  of  adaptability  to  a 
stressed  environment. 


This  Research 

An  underlying  concept  of  the  global  grid  concept  is  that  the 
National  Command  Authority  (NCA)  will  be  able  to  communicate 
directly  with  personnel  in  the  tactical  environment.  In  order  that 
graphics  and  video  information  be  delivered  in  a  timely  manner, 
high  capacity  links  are  essential.  ATM  is  a  candidate  for  the 
wide-area  part  of  this  network  since  it  is  wideband,  is  being 


3-5 


developed  commercially,  and  is  being  standardized  in  international 
forums.  However,  since  ATM  is  designed  for  peacetime  (well- 
behaved)  use,  adaptive  protocols,  such  as  those  incorporated  into 
STIP,  must  also  be  incorporated  into  ATM-based  networks  to  provide 
for  survivability  in  the  tactical  area.  This  report  documents 
research  by  the  authors  that  formulates  a  protocol  which  combines 
the  advantages  of  both  the  ATM  and  adaptive  (i.e.  STIP-like) 
protocols  into  a  network  suitable  for  the  military  environment. 
This  is  the  "RESEARCH"  area  indicated  in  Figure  1. 


THE  FAST  ATM  REROUTING  PROTOCOL 

Commercial  broadband  networks  are  expected  to  use  very 
reliable,  extremely  low  noise  optical  fiber  links.  Given  this 
assumption,  error  recovery  may  feasibly  be  removed  from  the 
internal  operation  of  the  network  to  the  network  edges.  Military 
networks  operating  in  a  theater  environment  cannot  make  this 
assumption.  The  US  military  may  be  called  upon  to  operate  in  areas 
far  removed  from  modern  networks  and  so  will  continue  to  rely  upon 
links,  characterized  by  high  bit  error  rates,  for  its  communication 
requirements.  Military  communication  networks  are  subject  as  well 
to  the  fluid  nature  of  combat  as  exemplified  by  frequent  network 
reconfiguration  and  hostile  action. 

The  thrust  of  this  research  is  to  find  ways  in  which  broadband 
networks  based  on  ATM  may  be  adapted  to  the  dynamic  theater 
environment.  The  authors  assert  that  fault  recovery  should  not  be 
removed  to  the  network  edges  in  the  theater  environment  but  must 
remain  internal  to  the  network.  It  will  be  shown  that  the  Fast  ATM 
Rerouting  (FAR)  protocol  proposed  by  the  authors  reduces  the 
disruption  to  the  network  users  due  to  failure.  It  does  so  by 
automatically  rerouting  to  an  alternate  virtual  path  connection 
(VPC)  route  in  a  way  that  is  functionally  compliant  with  ATM 
standards . 
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ALGORITHM  DESCRIPTION 


The  purpose  of  FAR  is  to  overcome  some  of  the  problems 
associated  with  high  speed  networks  when  disruptions  in  the  network 
occur.  Unlike  today's  relatively  low  speed  networks,  propagation 
delay  dominates  in  broadband  networks.  This  latency  means  that  a 
very  large  amount  of  data  (on  the  order  of  10“^  bits  per  1000  km,  or 
about  1500  ATM  cells)  may  be  in  transit  from  source  to  destination. 
In-theater  networks  must  be  prepared  for  frequent  disruptions,  or 
rapid  and  unpredictable  changes  in  their  state,  resulting  in  large 
losses  of  data.  FAR  is  a  step  towards  minimizing  the  potential 
loss  of  data  by  allowing  the  network  to  sense  and  react  as  quickly 
as  possible  to  these  disruptions.  This  paradigm  places  in  each 
node,  or  strategically  selected  nodes,  the  ability  to  assess  the 
quality  of  its  communication  with  its  neighbors  and  to 
automatically  reconfigure  appropriate  VPCs. 


The  FAR  Paradigm 

When  a  node  determines  that  its  ability  to  communicate  with  a 
neighbor,  i.e.  the  quality  of  service  (QOS)  ,  has  degraded  below 
some  threshold  (a  "fault"  occurs) ,  it  will  take  prompt  action  to 
activate  an  alternate  path.  The  QOS  could  involve  parameters  such 
as  bandwidth,  latency,  security,  etc. 

The  method  proposed  here  maintains  alternate  standby  paths 
(either  hot  standby  or  dormant)  that  can  be  activated  and  used  very 
quickly  when  a  fault  is  observed.  Figure  2  illustrates  one  such 
primary  VPC  with  one,  pre-defined  alternate  path  branching  from 
node  I.  Other  alternate  paths  may  exist  in  addition  to  this  one 
but  are  not  shown  in  order  to  simplify  the  diagram.  This 
representative  VPC  is  also  shown  in  isolation  from  other  VPCs 
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Figure  2.  Primary  VPC  and  alternate  for  node  I. 


traversing  the  same  nodes.  It  is  defined  by  source  node  S, 
destination  node  D,  intermediate  node  I  ,  n,,  upstream  (relative  to 
node  I)  intermediate  nodes  IR,  n^  downstream  intermediate  nodes  IF, 
and  n^  alternate  path  intermediate  nodes  lA.  It  is  also  assumed 
that  reverse  VPCs  exist  from  node  D  to  node  S  along  the  primary  and 
alternate  paths. 

The  typical  intermediate  node  I  has  detected  a  fault 
immediately  downstream  of  itself.  In  response,  node  I  takes  the 
necessary  steps  to  reestablish  the  VPC  to  D  along  the  n^  alternate 
set  of  nodes  lA.  The  new  route  continues  to  use  the  n^  IR  nodes 
upstream  from  the  failure,  but  bypasses  the  n^  downstream  forward 
path  nodes  IF  downstream  of  the  failure. 

In  addition  to  the  rerouting  mechanism  to  be  described  in  this 
paper,  it  is  assumed  that  there  are  two  other  processes  running 
asynchronously  in  the  background  at  each  node.  One  is  the  fault 
detection/reporting  function  mentioned  in  the  previous  paragraph. 
The  second  provides  the  alternate  route  to  be  used  when  a  link 
failure  is  detected.  Such  routes  may  be  updated  relatively  slowly 
compared  to  the  time  in  which  the  node  is  required  to  switch  to  the 
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alternate  route.  The  selection  of  the  alternate  routes  is  an  area 
for  future  study  and  is  beyond  the  scope  of  this  paper. 

An  advantage  of  FAR  is  that  most  resources  are  consumed  only 
when  needed,  with  VPI  resources  being  consumed  to  maintain 
alternate  paths.  Bandwidth  remains  available  to  other,  active 
VPCs.  FAR  is  also  distributed  with  each  node  taking  action  based 
on  0AM  (Operation  and  Maintenance)  cells  exchanged  with  its 
neighbors  allowing  it  to  recover  from  a  fault  in  any  link  or  node. 
A  two-way  handshake  is  provided  from  the  intermediate  node  I  to  D 
and  back.  However,  node  I  may  opt  for  an  early  reroute  immediately 
upon  detection  of  the  fault  before  confirmation  is  received  from 
the  handshake  as  to  the  availability  of  resources  on  the  alternate 
path. 

Assumptions  for  Protocol  Implementation 

The  goal  of  the  research  performed  in  the  summer  of  1993  at 
Rome  Labs  was  to  fully  implement  and  test  a  portion  of  the  FAR 
protocol.  The  limitation  of  time  dictated  that  we  adopt 
simplifying  assumptions  to  enable  us  to  reach  this  goal.  First 
only  one  alternate  path  branching  at  the  node  just  upstream  of  the 
fault  is  assumed.  A  hard,  yes/no  decision  is  made  on  QOS 
parameters,  ie  there  is  an  abrupt  transition  from  acceptable  to 
non-acceptable  QOS.  Only  one  try  is  made  at  rerouting  with  the 
source  being  notified  of  the  attempt  and  the  status  of  the  attempt. 
No  timeouts  are  provided  for  lost  0AM  cells.  A  two-way  handshake 
is  used  from  the  node  initiating  recovery  to  the  destination  node. 
No  handshake  is  used  upstream  from  the  initiating  node  but 
notification  of  action  taken  is  sent.  We  stress  that  these 
assumptions  are  not  inherent  to  the  FAR  protocol  but  were  made  only 
in  the  interest  of  reducing  the  time  to  achieve  measurable  results 
from  the  protocol. 
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QAM  Cells 


Node  I  could  assess  a  fault  through  the  use  of  FI,  F2  and  F3 
(0AM)  cells  as  defined  by  the  Consultative  Committee  on  Telephony 
and  Telegraphy  (CCITT)  [7].  Further  definition  by  the  CCITT  and 
other  standards  bodies  is  required  before  the  suitability  of  using 
these  0AM  cells  can  be  ascertained.  Therefore,  we  have  formulated 
generic  type  0,  1,  2  and  3  cells  that  perform  specific  functions 
required  of  the  FAR.  Type  0  cells  detect  and  report  fault 
conditions  between  ATM  devices.  Cell  types  1,  2  and  3  are  used  to 
recover  from  the  fault  condition.  The  functions  of  each  cell  type 
are  described  below: 

•  Type  0  Detects  faults  and  reports  on  the  status  of  the  VPC 

between  nodes  I  and  IF. 

•  Type  1  Sent  to  node  lA  along  the  alternate  route 

(downstream  of  I  and  bypassing  the  fault)  and 
forwarded  until  D  is  reached  or  the  alternate  path 
also  fails.  It  changes  the  alternate  path  to 
primary  route  status  for  the  VPC  in  question. 

•  Type  2  Sent  to  node  S  to  inform  it  of  the  VPC  status, 

"rerouting",  "rerouted",  or  "alt_path_fail" . 

•  Type  3  Sent  to  node  I  by  node  D  upon  receipt  of  a  type  1 

cell  or  by  one  of  the  intermediate  alternate  path 

nodes  lA  if  the  alternate  path  can  not  be  activated 
for  some  reason.  It  informs  I  if  the  alternate 
path  was  successfully  activated. 
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Protocol  Descriotion 


The  FAR  protocol  can  be 
viewed  as  a  Finite  State  Machine 
(FSM)  residing  at  or  contained 
within  an  ATM  node.  Seven  high 
level  states  are  presently 
specified  as  shown  in  Figure  3. 
Lower  level  states  may  be 
contained  within  each  high  level 
state. 


The  loop-back  state  Figure  3 .  Current  FSM  for  the  FAR 
consists  of  the  exchange  of  type  Protocol. 

0  cells  with  each  neighboring 

node.  Faults  are  indicated  by  status  conditions  returned  from 
downstream  or  by  a  timeout.  The  exact  operation  in  this  state  is 
not  yet  defined  pending  further  definition  of  standards.  Only 
faults  immediately  downstream  of  the  subject  node  are  currently 
detected. 


LEGEND; 

3  Port 
C533  VP  I 

- ^  Forward  VPC 

-  Reverse  VPC 

[ 0AM  Cell  of  Type  « 


ALTERNATE  PATH 


Figure  4.  VPC  schematic  and  0AM  cell  activity. 
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Figure  4  illustrates  the  VPCs  in  the  vicinity  of  node  I  and 
the  0AM  cell  traffic  generated  as  a  result  of  the  fault.  The 
outgoing  port  and  VPI  to  route  cells  along  the  alternate  path  are 
contained  in  the  alternate  path  output  port/VPI  table  entry  for 
that  VPC  as  shown  in  Figure  5.  In  the  fault  recovery  state  the  FSM 
first  checks  to  see  if  adequate  resources,  (ie  bandwidth)  is 
available  to  begin  the  activation  of  the  alternate  path.  If  so,  a 
type  1  cell  is  sent  to  the  next  downstream  node  lA,  which  then 
forwards  the  cell  to  the  next  node  lA  and  so  on  until  node  D  is 
reached.  A  type  2  cell  is  sent  directly  to  the  source  node  S 
indicating  that  the  network  is  attempting  to  reroute  the  VPC. 
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port/VPI  entry  for  a  VPC 
forward  VPC  index 
reverse  VPC  index 

Bandwidth  parameters  set  for  the  current  VPC 
Bandwidth  parameters  acceptable  if  alternate 
path  is  activated 

Address  of  node  connected  to  source 
Address  of  node  connected  to  destination 


Figure  5.  Expanded  VPC  table. 


If  the  resources  are  not  available,  a  type  2  cell  only  is 
sent,  again  to  node  S,  indicating  the  failure  of  both  the  primary 
and  alternate  paths.  In  the  representative  situation  shown  in 
Figure  2,  node  I  has  entered  the  fault  recovery  state,  transmitted 
a  type  1  and  type  2  cell  and  has  returned  to  the  monitor  state.  At 
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a  later  time  a  type  3  cell  will  return  to  node  I  from  node  D  or  one 
of  the  nodes  lA  causing  it  to  enter  the  alternate  path  indication 
state.  Node  S  enters  the  alternate  path  status  indication  state  on 
receiving  the  type  2  cell.  The  actions  that  may  be  taken  by  the 
source  could  include  temporarily  suspending  transmission,  preparing 
for  retransmission  of  possibly  lost  data,  or  any  other  number  of 
actions. 

An  "early  reroute"  option  is  available  in  the  fault  recovery 
state  to  reroute  the  VPC  immediately  after  the  type  1  cell  is  sent. 
Doing  so  has  the  advantage  of  potentially  reducing  the  number  of 
cells  buffered  or  lost.  The  disadvantage  is  that  if  resources  are 
not  available  downstream  congestion  problems  may  occur  for  other 
VPCs  traversing  the  same  nodes.  Evaluating  the  ramifications  of 
this  option  is  beyond  the  scope  of  this  paper  and  is  for  future 
research.  By  contrast  the  "late  reroute"  option  waits  on  the 
acknowledgement  from  the  destination  before  rerouting. 

The  activate  alternate  oath  recmest  state  is  entered  upon 
receipt  of  a  type  1  0AM  cell.  The  type  1  cell  arrives  over  the 
user  VPC  but  is  distinguished  by  a  reserved  VCI  or  cell  type 
indicator  or  both  [8].  The  receiving  node  activates  the 
resources  requested  by  the  type  1  cell  if  possible  and  checks  the 
destination  address  contained  in  the  data  field  of  the  cell  to  see 
if  it  is  the  end  node  for  that  VPC.  If  it  is  the  destination  node 
and  successfully  allocated  resources  to  the  VPC,  it  issues  a  type 
3  cell  along  the  reverse  VPC  defined  in  Figure  5.  If  is  not  the 
destination  node,  it  sends  the  cell  along  the  next  link  in  the 
alternate,  forward  VPC.  If  it  can  not  allocate  the  requested 
resources  to  the  alternate  VPC,  it  will  issue  a  type  3  cell  along 
the  reverse,  alternate  VPC  with  an  indication  that  the  alternate 
path  has  also  failed.  Figure  4  shows  node  lA  entering  the  activate 
alternate  path  request  state,  successfully  allocating  bandwidth, 
and  sending  the  type  1  cell  along  the  alternate,  forward  VPC. 


3-13 


When  a  node  receives  a  type  3  cell,  it  enters  the  activate 
alternate  path  indication  state.  Figure  4  shows  node  lA  receiving 
a  type  3  cell  from  downstream,  which  would  occur  at  some  time  after 
forwarding  the  original  type  1  cell  on  to  node  D,  and  forwarding 
the  cell  to  node  I.  If  the  data  field  of  the  cell  contains  an 
indicator  that  the  alternate,  forward  VPC  was  successful  node  I 
immediately  sends  a  second  type  2  cell  to  node  S  indicating  that 
the  alternate,  forward  path  was  successful.  If  any  node  receives 
a  type  3  cell  with  an  indication  that  the  alternate  VPC  failed,  it 
immediately  deallocates  the  alternate  VPC  and  sends  the  cell  back 
along  the  reverse  alternate  VPC.  The  type  3  cell  contains  the 
destination  address  of  node  I  so  that  node  I  will  know  the  status 
of  the  alternate  path  and  will  not  forward  the  cell. 


DETERMINISTIC  PERFORMANCE  ANALYSIS 

Parametric  equations  of  performance  of  the  FAR  protocol  have 
been  derived  for  the  representative  environment  depicted  in  Figure 
2.  Two  performance  metrics  are  determined,  the  time  to  affect 
early  reroute  and  the  time  to  affect  late  reroute.  A  discussion 
of  the  resulting  equations  is  beyond  the  scope  of  this  paper.  For 
this  analysis,  a  VPC  5000  km  long  was  assumed  consisting  of  six, 
equally  spaced  ATM  nodes.  Node  one  is  the  source  end-point,  node 
6  is  the  destination  end-point.  The  VPC  fails  3500  km  from  node  1 
placing  the  fault  500  km  downstream  of  node  4,  or  half-way  between 
nodes  4  and  5.  This  makes  node  4  responsible  for  recovery.  A 
reverse  direction  VPC  is  assumed  between  node  1  and  node  6.  An 
alternate  path  consists  of  node  7  being  equidistant  between  nodes 
4  and  6.  All  links  are  equidistant  at  1000  km.  An  estimate  is 
made  of  the  number  of  clock  cycles  required  to  perform  the  protocol 
operations,  and  the  propagation  speed  is  assumed  to  be  0.8c.  All 
links  assume  OC-3  capacity  yielding  148.608  Mbps  and  a  cell 
transmission  delay  of  2.853  /xsec.  Propagation  delay  between 
switches  is  4.167  msec. 
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Table  I.  Selected  results  of  deterministic  analysis. 
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The  results  of  the  FAR  analysis  are  compared  with  the  end-to- 
end  (ete)  protocol  that  relies  on  the  end-points  only  to  recover. 
The  nominal  case  assumed  two  forward  VPCs,  both  5000  km  long  with 
one  designated  as  the  alternate  path.  Various  parameters  were 
varied  and  the  results  are  shown  in  Table  I.  In  short,  the  FAR 
protocol  (internal  rerouting)  performs  significantly  better  than 
rerouting  performed  at  the  network  edges.  The  protocol  is 
insensitive  to  processing  time  unless  the  nodes  are  very  close 
together  as  in  a  LAN  configuration. 

For  VPC  lengths  ranging  from  5  km  to  77000,  the  FAR  protocol 
performs  two  to  three  times  faster  than  end-to-end  recovery.  The 
distance  from  a  node  to  a  fault  is  one  of  the  prime  determinants  in 
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time  to  reroute.  The  closer  the  node  spacing,  the  faster  the  fault 
will  be  detected.  The  end-to-end  and  the  FAR  protocols  converge 
when  a  VPC  consists  only  of  the  two  endpoints.  Conversely,  for  a 
VPC  traversing  several  nodes,  the  FAR  method  performs 
proportionally  faster. 

Another  trend  is  for  the  FAR  protocol  to  perform  much  better 
if  the  fault  occurs  near  the  downstream  end  of  the  VPC  rather  that 
the  transmitting  end.  However,  if  the  early  reroute  option  is 
used,  fault  location  has  no  effect  on  performance.  Transmission 
speed  was  decreased  to  10  Mbps  having  little  effect  on  time  to 
reroute  with  the  exception  of  short  VPCs  (LANs) .  At  speeds  of  10 
Mbps,  performance  for  the  FAR  protocol  remained  proportionally 
faster.  All  methods  had  a  sharp  increase  in  reroute  time  due  to 
transmission  delay  becoming  significant  relative  to  propagation 
delay. 


OPNET  ANALYSIS 

An  OPNET  simulation  of  the 
FAR  protocol  was  implemented  to 
confirm  the  correctness  of  the 
protocol,  and  to  validate  the 
results  of  the  deterministic 
performance  analysis.  OPNET  is 
an  event  driven  simulator  which 
provides  general  purpose 
routines  for  creating  and 
destroying  packets, 
determination  of  time  delays,  as 
well  as  for  user  defined 
functions  code  written  in  proto- 
C.  Protocols  in  OPNET  are 
modeled  by  a  FSM,  with  Figure  3 
showing  one  developed  for  the 
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Figure  7.  OPNET  model  of 
node. 


ATM 


utilized  in  this  simulation. 


FAR  protocol .  FSMs 

associated  with  processes  which 
are  linked  to  each  other  via 

streams  into  the  next  highest 
level  in  the  OPNET  hierarchy, 
nodes.  Figure  6  and  Figure  7 
show  nodes  of  the  FAR 

transceiver  and  ATM  switch.  An 
OPNET  function  named 
op_pk_send ( index)  is  available 
to  send  a  packet  over  a  stream 
with  an  identifier  of  index. 

OPNET  is  responsible  for 
delivering  the  packet  to  the  FSM 
(process)  at  the  receiving  end 
of  the  stream.  The  pre-defined 
processes  Queues,  point-to- 
point  transmitters  and 
receivers,  and  sources  are 


Figure  8  shows  the  subnet  of  transceiver  and  ATM  node  type 
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nodes  used  in  the  FAR  model  configuration.  The  links  are 
representations  of  physical  links  for  which  propagation  delay  and 
bit  error  rate  (BER)  can  be  specified.  Propagation  delay  was  set 
at  4.1667  Msec  (for  a  1000  km  link)  and  BER  was  not  used.  Three 
cases  were  run,  1  km  internodal  spacing,  1000  km  internodal  spacing 
and  38500  km  internodal  spacing  (3  nodes) .  These  cases  emulate  a 
LAN,  WAN  and  satellite  system  respectively.  Delay  to  reroute  once 
notified  of  a  fault  was  measured  and  is  in  excellent  agreement  with 
the  analytical  results  except  for  the  LAN  case.  This  is  because 
the  OPNET  simulator  handles  each  cell  several  more  times  than  was 
assumed  in  the  deterministic  analysis.  Thus  the  difference  is  not 
noticeable  unless  propagation  delay  is  very  small  as  in  the  LAN 
case.  The  differences  in  the  to  analyses  corresponds  to  the 
difference  in  the  number  of  times  each  cell  is  handled  in  the  two 
analyses.  Each  time  a  cell  is  handled  adds  2.85  ^sec  to  its  total 
delay  and  each  ATM  node  in  the  model  handles  the  cell  up  to  six 
times  (receive,  transmit,  and  4  queueings)  while  the  deterministic 
analysis  assumed  only  one  handling. 
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CONCLUSIONS 


The  authors  have  developed  an  original,  ATM  compliant 
rerouting  protocol,  FAR,  the  Fast  ATM  Rerouting  protocol.  It 
exhibits  robust  performance  over  a  wide  range  of  parameters  such  as 
processing  speed,  transmission  speed,  node  spacing  and  VPC  length. 
The  protocol  has  been  implemented  and  verified  using  the  OPNET 
simulator  and  compliments  and  ties  together  other  RL/C3D  research 
efforts. 
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Abstract 

The  impact  of  network  topology  on  the  performance  of  a  distributed  processing 
environment  was  evaluated  through  ejqjerimentation  and  simulation.  Experimental 
runs  of  the  JDL  experiment  on  Cronus  were  used  to  collect  traffic  data  for  use 
in  determining  the  limitations  of  Ethernet -based  networks.  Discrete-event 
simulation  and  analytical  modeling  were  used  to  estimate  the  performance  of 
heavily- loaded  Ethernet  and  FDDI  networks. 
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EVALUATION  OF  NETWORK  TOPOLOGY  IN  A  DISTRIBUTED  PROCESSING  ENVIRONMENT 


Kamal  Jabbour  and  Francis  X.  Reichmeyer 


1.  INTRODUCTION 

The  attractiveness  of  distributed  computing  on  networks  of  heterogeneous 
workstations  has  created  an  interesting  class  of  problems.  The  communications 
networks  and  protocols  used  in  such  applications  were  originally  designed  with  a 
different  usage  in  mind,  primarily  reliable  packet  transport  over  unreliable  and 
low-capacity  links.  Distributed  operating  systems,  such  as  BBN's  Cronus,  use 
standard  technologies  like  TCP/IP  over  Ethernet,  and  as  a  result  suffer  from 
their  severe  limitations. 

In  this  research  we  looked  at  distributed  processing  over  heterogeneous  systems 
and  simulated  the  expected  benefit  from  using  new  technologies.  In  particular, 
we  evaluated  the  performance  of  ~he  JDL  (Joint  Director  of  Labs)  experiment  over 
a  group  of  Ethernet  networks,  characterized  its  data  traffic,  and  projected  the 
potential  in^rovement  from  using  FDDI  networks. 

This  report  starts  with  a  brief  introduction  to  the  JDL  experiment  and  the 
Cronus  distributed  operating  system,  followed  by  an  outline  of  the  data 
collection  strategy  and  a  characterization  of  the  JDL  traffic.  The  results  are 
generalized  to  provide  a  generic  characterization  of  the  packet  traffic  for 
typical  distributed  computing  applications,  A  computer  simulation  of  the 
performance  of  an  Ethernet -based  system  under  increasingly  heavy  load  is 
presented  next,  followed  by  an  analytical  model  of  an  FDDI-based  system.  The 
report  concludes  by  identifying  unanswered  questions  and  topics  for  future 
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research,  dealing  both  with  the  chosen  topology  and  the  transport  protocol. 

2.  CRONUS  AND  THE  JDL  EXPERIMENT 

In  this  section  we  introduce  briefly  the  JDL  Experiment  and  the  distributed 
operating  system  environment,  Cronus,  on  which  it  runs.  The  JDL  Experiment  was 
used  as  a  sample  distributed  processing  application  for  the  purpose  of 
characterizing  network  traffic,  and  subsequently  extrapolating  such  traffic 
characteristics  to  other  classes  of  distributed  applications. 

2.1  Cronus 

Cronus  is  a  software  environment  for  the  development  and  operation  of 
distributed  computing  applications  [1].  Cronus  was  developed  by  BBN  Systems  and 
Technologies  of  Cambridge,  MA,  under  contract  with  Rome  Air  Development  Center 
(Rome  Laboratory)  in  the  early  1980s.  The  Cronus  environment  runs  at  the  user 
level,  above  the  operating  system,  of  several  different  host  hardware  and 
operating  system  combinations.  It  uses  an  object-oriented  architecture  within 
which  every  system  resource  and  application-specific  resource  is  viewed  as  a 
typed  object  under  the  control  of  an  object  manager.  An  object  manager 
implements  all  the  operations  that  correspond  to  a  particular  object  type. 

In  the  Cronus  environment,  client  and  object  managers  need  not  reside  on  the 
same  machine,  and  operations  can  be  invoked  across  machine  boundaries.  A  client 
process  does  not  need  to  know  the  specific  locations  of  objects  and  managers,  as 
Cronus  automatically  routes  operation  invocations  to  the  appropriate  manager. 

2.2  The  JDL  Experiment 

The  Trl-Servlce  Distributed  Technology  Experiment  was  initiated  as  an 
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unclassified  activity  by  the  Distributed  Processing  Subconunittee  of  the  Joint 
Directors  of  Laboratories  (JDL)  [2].  The  concept  of  the  experiment  centers 
around  a  low-cost  distributed  computer  system  application  at  three  sites;  Naval 
Command,  Control,  Ocean  Systems  Center  RDT&E  Division  (NRaD),  San  Diego,  CA; 
Rome  Laboratory  (RL),  Rome,  NY;  and  Communications  and  Electronics  Command 
(CECOM),  Fort  Monmouth,  NJ.  The  distributed  confuting  environment  Cronus  was 
chosen  for  the  experiment. 

The  JDL  Experiment  consists  primarily  of  eight  managers,  exchanging  data  in  a 
packet  switched  mode  through  Cronus  by  using  UDP  and  TCP/IP.  The  operations  of 
these  managers  is  beyond  the  scope  of  this  report,  since  our  interest  lies  in 
characterizing  the  data  traffic  generated  by  the  experiment. 

The  JDL  experiment  can  be  run  in  a  one-site  (RL),  two-site  (RL  and  CECOM),  or 
three-site  (EIL,  NRaD  and  CECOM)  configuration.  Computers  at  these  cites  are 
primarily  workstations  connected  locally  with  Ethernet  LANs.  The  LANs  are  in 
turn  interconnected  through  T-1  links  over  terrestrial  wideband  links. 


3.  TRAFFIC  CHARACTERIZATION 

We  conducted  local  (RL-only)  JDL  runs  separately  under  light  network  traffic  and 
under  relatively  heavy  network  traffic,  as  well  as  multi-site  runs  under  normal 
prevailing  traffic.  Etherfind,  a  network  monitoring  software  provided  with  the 
SUN  OS,  was  used  to  capture  the  packet  headers  on  the  local  network  for  use  in 
traffic  characterization.  A  major  limitation  of  the  software/hardware  is  that 
workstations  keep  time  to  a  resolution  of  10  msec,  thereby  preventing  more 
precise  measurement  of  delay. 
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3.1  Packet  Size  Distribution 


A  single-site  JDL  run  generated  cibout  24,137  packets  exchanged  between  the 
various  computers.  The  majority  (23589  or  98%)  of  these  packets  used  TCP,  while 
the  remaining  548  (2%)  used  UDP.  These  proportions  changed  only  slightly  (94%  vs 
6%)  in  multi-site  runs.  These  packets  represent  collectively  fewer  than  8,000 
transactions.  By  using  TCP,  and  its  inherent  inefficiency,  each  transaction 
requires  a  minimum  of  3  packets:  request,  reply,  acknowledgement. 

A  histogram  of  Ethernet  packet  size  distribution  shows  a  basic  tri-modal 
distribution  at  60  bytes  (7675  packets)  213  bytes  (6971)  and  239  bytes  (6971) 
(Fig  4-1).  The  largest  packet  is  619  bytes,  well  below  Ethernet's  packet  size 
limit.  Thus  we  have  observed  no  fragmentation. 

It  may  be  useful  at  this  point  to  discuss  TCP  packet  structure  to  put  the  above 
packet  sizes  in  perspective.  A  typical  TCP  packet  consists  of  a  14-byte  Ethernet 
header,  a  20-byte  IP  header,  a  20-byte  TCP  header,  and  up  to  6  bytes  of  padding 
to  achieve  a  60-byte  packet.  Thus  a  1-byte  acknowledgement  carries  an  additional 
59  bytes  of  overhead  in  a  60-byte  packet.  Similarly  the  239-byte  packets  consist 
of  185  bytes  of  data  and  54  bytes  of  overhead. 

In  analyzing  JDL  traffic,  we  looked  at  two  sample  transactions:  one  dealing  with 
the  Timer  Manager  (running  on  Mensa),  and  the  second  with  the  Target  Filter 
Manager  (running  on  Orion). 

The  Timer  Manager  accounts  for  the  bulk  of  the  network  traffic  in  JDL,  namely 
20,904  out  of  24,137  packets.  A  typical  timing  transaction  consists  of  a  request 
(213-byte  packet),  a  reply  (239-byte  packet)  after  about  20  ms,  and  an 
acknowledgement  ( 60-byte  packet ) : 
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Time 

bytes 

source 

destination 

srcport 

dstport 

46.72 

239 

TCP 

janus.rl.af.mil 

mensa.rl.af.mil 

1433 

2611 

46.74 

213 

TCP 

mensa.rl.af.mil 

janus.rl.af.mil 

2611 

1433 

46.92 

60 

TCP 

janus.rl.af.mil 

mensa.rl.af.mil 

1433 

2611 

The  Target  Filter  Manager  responds  to  requests  for  data  and  accesses  the  target 
database.  A  typical  local  transaction  consists  of  a  request  (387  bytes),  an  ack 
(60  bytes)  after  about  SOms,  a  reply  (219  bytes)  after  about  2.52  sec,  and  a 
final  ack  (60  bytes)  after  about  80  ms: 


Time 

bytes 

source  destination 

srcport 

dstport 

692.20 

387 

TCP  phobos.rl.af .mi  orion.rl.af.mil 

1154 

2611 

692.28 

60 

TCP  orion.rl.af.mil  phobos.rl.af .mi 

2611 

1154 

694.80 

219 

TCP  orion.rl.af.mil  phobos.rl.af .mi 

2611 

1154 

694.88 

60 

TCP  phobos.rl.af .mi  orion.rl.af.mil 

1154 

2611 

In  general,  we  can  characterize  JDL  traffic  as  consisting  of  a  series  of 
transactions,  each  consisting  of  a  request  for  service,  followed  by  an  optional 
acknowledgement,  a  reply,  and  a  final  acknowledgement.  Acks  are  60  bytes  long, 
while  data  packets  range  from  76  to  619  bytes,  with  a  predominance  in  the 
200-400  byte  range. 

3.2  Time  Domain  Characterization 

In  the  time  domain,  ack  packets  are  generated  within  10-100  ms  (Fig  4-3),  while 
manager  service  times  range  from  a  few  ms  to  several  seconds  (Fig  4-4).  Service 
times  consist  of  two  components:  communication  time  and  processing  time.  Under 
very  light  load  conditions,  communication  times  are  primarily  data  transmission 
times,  of  the  order  of  100-500  microseconds.  Since  a  single-site  JDL  run 
requires  the  transmission  of  about  24,000  packets  over  a  period  of  an  hour,  or 


4-7 


less  than  7  packets  per  second,  such  a  run  generates  a  very  light  network  load 
and  has  a  negligible  effect  on  network  performance. 

3.3  Additional  Data 

In  addition  to  the  JDL  Experiment  described  above,  experimental  runs  of  two 
other  distributed  applications  were  conducted.  The  Australian  Experiment  [3]  was 
run  at  the  DISE  facility  at  Rome  Lab,  and  an  ISIS  run  was  performed  at  the 
Software  Engineering  Lab  at  Syracuse  University  (Fig  4-2).  These  applications 
featured  a  multimedia  component  and  the  corresponding  large  packets  for  video 
screen  updates.  Whereas  the  JDL  Experiment  had  primarily  small  packets, 
multimedia  applications  require  the  transfer  a  large  files,  resulting  in  packet 
fragmentation. 

3.4  Background  Traffic 

As  discussed  in  the  previous  sections,  JDL  traffic  alone  offers  a  very  light 
load  to  an  Ethernet.  Therefore  the  performance  of  a  JDL  run,  or  any  distributed 
application  for  that  matter,  is  directly  affected  by  the  background  traffic  on 
the  network.  Therefore  we  have  tried  to  characterize  experimentally  the 
background  traffic  by  collecting  a  significant  amount  of  network  traffic  data. 

Based  on  the  data  collected,  we  have  characterized  the  packet  arrival  process 
with  an  exponential  distribution  (Fig  4-5).  Further,  we  have  observed  that  the 
packet  size  distribution  contained  a  significant  peak  (about  half  the  packets) 
at  60  bytes,  a  small  peak  at  1514  bytes  (fragments  at  maximum  Ethernet  packet 
size),  and  an  otherwise  random  distribution  between  60  and  566,  decaying 
exponentially  beyond  that  (Fig  4-6). 

An  analysis  of  the  traffic  patterns  on  both  the  Syracuse  University-  SELab  and 
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Rome  Lab-DISE  networks  suggests  that  the  majority  of  the  traffic  is  generated  by 
remote  login  and  telnet  users.  In  a  remote  login  session,  each  keystroke 
generates  3  TCP  packets,  each  consisting  of  60  bytes  (1  packet  contains  the  key 
stroke,  a  return  packet  carries  the  echo  back,  and  a  third  ack  packet  completes 
the  3-way  handshake).  At  the  other  extreme,  Network  File  Systems  (NFS)  traffic 
consists  primarily  of  large  files  fragmented  into  packets  of  1500  bytes  (unless 
a  lower  maximum  packet  size  is  set  at  internet  gateways). 


4.  GENERIC  TRAFFIC  CHARACTERIZATION 

The  distributed  processing  applications  described  earlier  allow  us  to  develop  a 
generic  model  for  characterizing  network  traffic.  Such  a  generic  model  will  be 
application-independent  and  allows  a  better  understanding  of  the  impact  of  the 
communications  environment.  We  will  assume  a  packet-switched  environment  where 
all  communication  between  processes  uses  packets  (datagreuns) . 

4.1  Basic  Transaction 

The  generic  model  assumes  that  a  distributed  processing  application  consists  of 
a  group  of  service  managers  located  on  various  computers.  A  request  for  service 
can  originate  from  any  host  on  the  network,  and  may  be  preceded  by  an  initial 
broadcast  to  locate  the  server.  We  will  define  a  transaction  as  a  complete  set 
of  packet  transmissions  consisting  of  a  request  for  service,  an  optional 
acknowledgement,  a  reply  packet,  and  a  final  acknowledgement.  Acknowledgements 
are  required  to  complete  the  3-way  handshake  necessary  in  an  unreliable  network 
environment  (such  as  using  TCP/IP),  The  assumption  of  a  reliable  (low  error 
rate,  eg:  optical)  network  allows  the  use  of  a  more  efficient  protocol. 
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4.2  Packet  Generat ion 


We  assume  that  the  packet  generation  intervals  are  exponentially  distributed. 
In  an  ideally  active  network  supporting  several  simultaneous  distributed 
processing  applications,  we  assume  that  all  nodes  act  as  clients  as  well  as 
servers,  therefore  all  nodes  have  similar  packet  generation  characteristics. 
The  value  of  the  mean  interval  determines  the  loading  level  of  the  network.  For 
example,  a  JDL  Experiment  generates  about  7  packets  per  second  on  average. 

Similarly,  background  traffic  (generated  by  non-distributed  processing  users) 
can  be  modeled  with  an  exponential  interarrival  distribution  with  a  variable 
mean  that  determines  the  background  load  on  the  network. 

4.3  Packet  Size  Distribution 

The  distributed  processing  applications  that  we  examined  rely  predominantly  on 
packets  of  a  few  hundred  bytes  (200-600  bytes)  for  their  communication.  A 
handshake  packet  requires  a  minimum  of  1  byte,  burdened  by  up  to  60  bytes  of 
header  information  (addressing,  sequencing,  error  detection...).  Images, 
high-resolution  graphics,  and  multimedia  data  require  large  packets  of  several 
kbytes  (typically  4-20  kbytes). 

To  simplify  our  model,  we  define  packet  sizes  excluding  transport  layer  headers 
and  independent  of  network  limitations  (Ethernet  limit  is  1,500  bytes,  FDDI 
limit  is  4,500  bytes).  We  assume  that  distributed  processing  traffic  consists  of 
3  packet  sizes,  namely  60  bytes  (handshake  packets),  300  bytes  (requests  and 
replies),  and  9,000  bytes  (for  graphics).  The  relative  distribution  of  packets 
among  these  three  sizes  is  left  as  a  model  variable,  allowing  the  specification 
of  a  range  of  applications. 


4-10 


4.4  Packet  Transmission  Time 


Packet  transmission  time  is  a  primary  variable  of  interest  in  modeling  the 
communication  characteristics  of  a  distributed  processing  environment.  It 
depends  primarily  on  the  type  of  local  area  network  (Ethernet,  FDDI,  ATM),  the 
internetworking  topology  for  multi-site  applications,  and  the  transport  protocol 
used. 

4.5  Processing  Time 

The  final  component  of  the  generic  traffic  model  is  the  remote  service  time,  or 
manager  processing  time  at  a  remote  site.  This  is  the  time  it  takes  a  service 
manager  to  complete  the  necessary  processing  and  generate  a  reply  to  a  request 
for  service.  In  the  above  applications,  this  processing  time  ranged  from 
sub-millisecond  (request  for  timing  information)  to  several  seconds.  We  assume 
that  the  processing  time  is  exponentially  distributed,  with  the  mean  determining 
the  computational  intensity  of  a  given  application. 


5.  COMPUTER  SIMULATION  OF  AN  ETHERNET  BASED  SYSTEM 

Currently,  the  JDL  experiment  runs  on  three  Ethernet  LANs,  one  at  each  of  the 
labs  involved  in  the  project,  as  described  above.  In  this  section  we  describe 
the  simulation  of  the  local  (RL  only)  JDL  experiment.  The  simulation  was 
performed  to  observe  the  degradation  of  Ethernet  under  heavy  loads.  Since  it  is 
not  possible  to  conduct  multiple  JDL  runs  simultaneously  on  the  network,  and 
since  one  JDL  experiment  by  itself  does  not  generate  enough  network  traffic  to 
cause  a  degradation  of  the  LAN,  simulation  provides  insight  that  cannot  be 
gained  through  experimentation  alone. 
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The  simulation  program  is  written  in  C  using  the  SMPL  library  of  subroutines  for 
discrete-event  and  random  variate  simulation  [4],  We  consider  an  Ethernet  LAN  of 
32  workstations  connected  with  a  total  cable  length  of  2.5  kilometers.  Each 
station  is  considered  to  be  identical  in  terms  of  traffic  generation. 
Therefore,  LAN  access  is  modeled  as  a  single  queue  of  size  32,  which  accepts 
generated  packets  and  releases  them  on  a  first  in  first  out  (FIFO)  basis 
according  to  the  Carrier  Sense  Multiple  Access  with  Collision  Detection 
(CSMA/CD)  protocol  of  Ethernet.  If  the  queue  is  full  when  a  new  packet  is 
generated,  that  packet  is  discarded.  Throughput  and  average  delay  are  measured 
as  a  function  of  the  offered  load  to  the  network. 

The  Ethernet  LAN  is  modeled  as  an  open  system,  where  the  total  offered  load  G  to 
the  network  is  the  product  of  the  arrival  rate  L  of  new  packets  (for  all 
stations)  and  the  mean  transmission  time  T  for  a  packet,  G=LT.  The  packet  length 
distribution  determines  the  mean  transmission  time  T  and  the  packet  interarrival 
times  are  obtained  from  an  exponential  distribution  with  mean  1/L.  Both  packet 
length  distribution  and  G  are  supplied  as  inputs  to  the  simulation  by  the  user. 

We  considered  several  different  packet  length  distributions  corresponding  to 
different  distributed  computing  applications  including  JDL,  the  Australian 
experiment,  and  an  ISIS  run,  all  three  with  no  background  traffic.  That  is,  the 
only  packets  on  the  LAN  are  those  generated  by  the  experiments.  For  JDL,  the 
packet  distribution  is  as  follows: 

60%  §  60  bytes,  30  %  @  300  bytes,  and  10  %  §  1500  bytes. 

The  Australian  experiment  is  more  computing  intensive,  having  a  distribution  of: 
60%  §  1500  bytes,  30%  §  300  bytes,  and  10  %  @  60  bytes. 

The  1500-byte  packets  represent  the  fragments  of  the  larger  packets  (mostly  8000 
bytes)  produced  by  this  application.  The  ISIS  run,  simulating  an  application 
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using  the  ISIS  distributed  operating  system,  has  the  distribution: 

57%  @  120  bytes,  32%  @  1500  bytes,  and  11%  §  60  bytes. 

We  also  simulated  the  JDL  and  Australian  experiments  with  background  traffic. 
In  all  cases,  the  interarrival  times  were  computed  from  offered  load  and 
transmission  time  for  the  mean  packet  size  (L=G/T). 

The  attached  figures  show  plots  of  throughput  and  delay  versus  offered  load  (Fig 
4-7,  4-8).  Average  packet  delay  is  measured  from  the  time  a  packet  is  generated 
to  the  time  it  is  received.  Throughput  is  defined  as  the  part  of  the  channel 
utilization  used  for  successful  data  transmission,  based  on  the  total  packet 
length,  including  the  preamble  (ethernet  header)  and  data  field. 

6.  PERFORMANCE  ANALYSIS  OF  AN  FDDI  BASED  SYSTEM 

An  FDDI  network  was  simulated  analytically  with  the  same  traffic  characteristics 
used  above  for  Ethernet.  We  assumed  a  cable  length  of  2.5  km,  32  stations  spaced 
equally  around  the  ring,  and  ring  access  based  on  the  MAC  layer.  The  Target 
Token  Rotation  Time  was  assumed,  small  enough  to  permit  the  transmission  of  only 
one  packet  per  token  acquisition.  Finally,  large  data  packets  were  fragmented 
to  4500  bytes  as  opposed  to  1500  bytes  in  the  case  of  Ethernet. 

An  analytical  model  of  the  FDDI  ring  was  developed  to  estimate  network 
throughput  and  delay  in  a  simulated  distributed  processing  environment.  We 
assumed  that  several  applications  were  running  simultaneously,  with  every 
station  hosting  a  number  of  managers.  Therefore  we  assumed  a  homogeneous  ring, 
as  far  as  traffic  patterns  are  concerned  [5]. 

Throughput  is  shown  in  Fig  4-9.  Offered  load  refers  to  the  fraction  of  total 
network  capacity.  Since  FDDI  provide  a  deterministic  data  transmission  medium, 
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representing  throughput  in  number  of  packets  per  second  provides  more  useful 
information  than  presenting  it  as  a  fraction  of  offered  load  (the  latter 
relationship  is  linear  until  meocimura  load  is  reached,  then  flattens  out). 

Average  packet  delay  is  shown  in  Fig  4-10,  and  included  both  the  queueing  delay 
Inside  individual  nodes  and  the  transmission  delay.  The  wait  queue  Inside  the 
node  was  modeled  as  a  M/D/1  queue,  assuming  a  Markov  arrival  rate,  and  a 
deterministic  service  rate. 

Comparing  the  performance  of  Ethernet  and  FDDI  shows  that  FDDI’s  degradation 
under  heavy  loads  is  more  graceful  than  that  of  Ethernet.  In  addition  to  the 
obvious  order  of  magnitude  in  transmission  speed  (100  Mbps  for  FDDI  vs  10  Mbps 
for  Ethernet),  the  deterioration  in  Ethernet's  performance  is  caused  by 
collision,  whereas  FDDI's  deterioration  comes  from  the  queueing  delays  within 
each  node . 

At  low  utilization,  such  as  10%  of  offered  load,  it  is  not  surprising  that 
Ethernet's  performance  is  as  good,  if  not  better,  than  FDDI's.  With  an 
estimated  10-bit  delay  in  each  FDDI  station,  it  takes  a  long  time  for  a  token  to 
rotate  idly  around  a  ring. 


7.  FUTURE  EFFORT 

Our  work  has  shown  that  current  technology  is  well-suited  for  distributed 
applications  of  the  class  of  the  JDL  or  Australian  experiment,  which  offer  a 
small  load  to  an  Ethernet.  However,  multimedia  intensive  applications  generating 
traffic  several  orders  of  magnitude  larger  than  JDL  cannot  run  on  existing 
networks,  and  necessitate  new  topologies  and  transport  protocols.  In  addition, 
the  intact  of  the  network  architecture  on  the  traffic  characteristics  requires 
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further  study.  The  use  of  an  FDDI  network,  or  an  FDDI-ATM  interconnection,  with 
the  resulting  deterministic  delay  characteristics,  will  reduce  significantly  the 
number  of  smaller  packets  and  result  in  a  larger  number  of  large  packets. 

In  future  effort  we  propose  to  examine  the  performance  of  two  optical  networks. 
The  first  network  uses  multiple  FDDI  rings  interconnected  with  high-speed 
point-to-point  optical  links  through  a  single  central  ATM  switch.  The  second 
network  uses  a  hierarchical  ATM  architecture  where  the  computers  are  connected 
to  local  ATM  switches  which  are  in  turn  connected  to  a  central  ATM  switch.  We 
propose  to  compare  the  delay  and  throughput  achieved  by  these  configurations  and 
determine  their  suitability  for  distributed  processing. 
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Abstract 

Quantized-state  adaptive  algorithms  are  proposed  for 
adaptive  multibeam  antenna  signal  processing  and  are  compared  to 
the  known  sample  matrix  inversion  and  the  correlation  feedback 
algorithms  of  the  adaptive  antenna  signal  processing  literature 
in  terms  of  various  performance  measures  of  interest.  Such 
performance  measures  include  the  interference  rejection 
capability,  the  computational  requirements,  convergence  rate, 
numerical  robustness  for  implementation  with  finite  precision 
arithmetic  and  fraction  of  the  area  coverage  lost  due  to  nulling 
of  interference  sources.  Simulation  results  show  that  the 
quantized  state  algorithms  proposed  earlier  by  the  author  for 
somewhat  different  signal  processing  problem  of  adaptive 
equalization,  along  with  some  very  significant  modifications  such 
as  inclusion  of  an  appropriate  deadzone  in  the  quantization 
process,  multiple  thresholds  and  a  novel  method  of  step  size 
selection  etc.,  constitute  powerful  algorithms  for  the  multibeam 
antenna  signal  processing  problems.  The  proposed  schemes  are 
shown  to  be  superior  to  other  known  techniques  in  terms  of  most 
of  the  performance  criteria. 
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ADAPTIVE  ALGORITHMS  FOR  ANTENNA 

array  signal  processing 


Rajendra  Kumar 


1.  Introduction 

A  satellite-based  adaptive  antenna  must  be  designed  to 
combat  tnterfetenoe  sources  including  intentional  Jammers  located 

of^thT*  pattern 

afforded  b  "“‘  accomplished  by  the  high  resolution 

y  a  multiple  beam  antenna  with  a  relatively  large 
aperture  size  and  a  significant  number  of  beams  (providing  the 
ssired  number  of  degrees  of  freedom)  [l].  The  outputs  of  these  N 
beams  are  linearly  combined  in  an  adaptive  manner  so  as  tl 
ntroduce  nulls  in  the  direction  of  interference  sources  while 
Minimizing  the  inevitable  antenna  gain  reduction  in  any  other 
ireotion  within  the  quiescent  beam  width.  To  achieve  the 

ficiently  rapid  convergence  rate  to  adapt  in  a  dynamic 
scen„io  such  as  the  presence  of  blinhing  Jammers,  have 
reasonable  computational  requirements  in  terms  of  the  actual 
number  of  arithmetic  operations  required  per  update  of  the 
algorithm.  Apart  from  the  number  of  operations,  it  is  equally 
mportant  that  the  algorithm  be  insensitive  to  arithmetic 

tr;:;:h;r^‘’  -  -  -  -  - 

the  adaptive  algorithms  of  the  literature  are  based  on 

ZLrT  T  the  array  gain  (slgnal-to- 

interference-pius  noise  ratio)  for  a  signal  source  in  the 

irection  of  the  peak  of  the  antenna  gain  as  specified  by  the 

steering  vector[2,31.  The  specified  steering  vector  or  the 

quiescent  weight  vector  actually  produces  -enough'  directive  gain 

aLe“nce"of  “  the 

absence  of  jammers  -  essentially  a  property  of  the  MBA 
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antenna . 


However,  this  may  not  be  the  case  with  the  adapted  pattern  in  the 
presence  of  jammers  and  the  'optimal'  algorithm  termed  SMI 
(sample  matrix  inversion)  [4]  may  be  sub  optimum  with  respect  to 
the  criterion  of  area  covered  by  the  adapted  beam  .  In  terms  of 
implementation,  the  SMI  algorithm  requires  order  complex 
multiplications  and  additions  for  each  data  update  and  an  NxN 
matrix  inversion  for  each  update  of  the  weight  vector.  For 
reasonably  large  value  of  N  (say  91),  this  may  imply  about  10,000 
multiplications  for  each  data  update.  In  a  dynamic  scenario 
involving  blinking  jammers,  it  may  be  necessary  to  have  data 
update  in  a  fraction  of  jisec  and  thus  very  significant 
computational  requirements  are  involved.  Furthermore,  in 
situations  involving  multiple  jammers  of  varying  power  levels, 
the  estimated  correlation  matrix  becomes  ill  conditioned  and  high 
precision  arithmetic  is  required  to  avoid  numerical  instability. 

A  simpler  version  of  this  algorithm  termed  correlation 
feedback  (CF)  requires  only  order  N  computations.  However,  for 
the  case  of  high  condition  number  of  the  relevant  correlation 
matrix  involved,  the  convergence  is  very  slow  and  may  be 
unacceptable  in  some  dynamic  scenarios  .  Note  that  if  instead  of 
steering  vector,  a  desired  (pilot)  signal  from  the  desired 
direction  is  given,  then  the  CF  algorithm  is  equivalent  to  the 
LMS  algorithm  of  the  literature [4] .Also  both  SMI  and  CF 
algorithms  have  their  steady-state  solutions  (under  assumed 
stationarity  condition)  equal  to  the  steady-state  solution  of  the 
analog  algorithm  of  Applebaum-Howell  [5,6]  derived  on  a  more 
intuitive  basis. 

A  strong  motivation  therefore  exists  to  explore 
algorithms  which  have  convergence  rates  similar  to  that  of  SMI 
algorithm  but  with  much  smaller  computational  requirements,  are 
numerically  robust  and  are  capable  of  providing  better  coverage 
area  than  the  SMI  algorithm.  The  quantized  state  (QS)  algorithms 
proposed  earlier  by  the  author  for  a  somewhat  different  signal 
processing  problem  [7],  with  some  very  significant  modification 
such  as  inclusion  of  an  appropriate  deadzone  in  the  quantization 
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process,  multiple  thresholds  and  a  novel  method  of  step  size 
selection,  etc. ,  are  applied  here  to  the  MBA  signal  processing 
problem.  These  algorithms  only  require  n2  increment /decrement 
operations  (as  against  complex  multiplications  and  additions 
required  for  SMI  algorithm) ,  have  their  convergence  rates  similar 
to  those  of  SMI  and  thus  orders  of  magnitudes  faster  than  CF. 
Also,  the  matrix  involved  in  the  QS  algorithm  has  a  condition 
number  which  is  approximately  equal  to  the  square  root  of  the 
correlation  matrix  involved  in  the  SMI  algorithm  and  thus  the  QS 
algorithms  are  much  more  robust  in  numerical  computations.  It 
may  be  remarked  that  much  more  computationally  intensive 
algorithms  are  often  required  to  alleviate  the  matrix  ill 
conditioning  problem  [8].  These  algorithms  are  based  on  the 
update  of  the  square  root  of  the  correlation  matrix  so  as  to 
effectively  reduce  the  condition  number  (CN)  to  a  value  equal  to 
the  square  root  of  the  original  matrix  but  at  the  cost  of  order 
n3  computations.  The  QS  algorithms  in  contrast  yield  the  same 
advantage  as  afforded  by  these  more  computation  intensive 
algorithms,  yet  at  the  same  time  the  QS  algorithms  actually 
reduce  the  computational  requirement  in  comparison  to  the  direct 
SMI  algorithm  instead  of  an  increase  to  achieve  this  CN 
advantage.  The  proposed  scheme  is  also  superior  to  the  SMI 
algorithm  in  terms  of  antenna  pattern  area  coverage,  although 
there  is  considerable  variability  associated  with  improvement 
with  respect  to  this  criterion. 

2 .  Adaptive  Algorithms 

Figure  1  depicts  the  configuration  for  the  beamformer 
wherein  the  amplified  rf  signals  are  weighted  by  complex  weights 
'*^1/  •••»  combined  to  produce  the  beamformer  output 
{r(t)}  whose  complex  envelope  is  represented  by  z(t).  The  weight 
vector  w=[wi,W2 . .  .wjj]  '  is  adapted  by  the  adaptive  algorithm  so  as 
to  minimize  the  residual  power  at  the  beamformer  output,  subject 
to  a  constraint  on  signal  response  from  the  desired  direction 
specified  by  the  vector  d  or  more  appropriately  the  signal 
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response  for  a  specified  quiescent  weight  vector.  That  is, 

MinEpzpj  =  w“Rw  ;  R  =  e[xx“]  (2.1a) 

Subject  to  d^w  =  1  (2.1b) 

Here  it  is  implicitly  assumed  that  there  are  no  desired  signa’ s 
other  than  in  the  direction  corresponding  to  the  steering  vectcy, 
otherwise  the  algorithm  would  try  to  minimize  the  signal  pou’ir 
from  those  sources  as  well.  One  possible  solution  to  avoid  this 
problem  is  to  incorporate  a  "quiet"  period  during  which  all 
desired  signal  sources  cease  transmission  and  the  beamfomer 
weights  are  "trained"  during  this  period.  This  is  followed  by  a 
signal  transmission  period  during  which  the  beamformer  weights 
are  not  perturbed.  The  obvious  disadvantage  of  this  strategy  is 
that  it  may  not  be  possible  to  null  the  interference  from 
blinking  jammers.  However,  as  will  be  seen  later,  the  QS 
algorithms  can  effectively  shield  the  signal  sources  from  the 
adaptation  algorithm  provided  that  their  signal  power  levels  are 
smaller  than  the  jammer  power  levels  and,  in  these  situations  it 
is  no  longer  necessary  to  have  a  "quiet"  or  "training"  period 
with  QS  algorithm.  In  those  situations  where  power  level 
constraints  are  not  satisfied,  more  sophisticated  strategies  can 
be  devised  to  avoid  signal  cancellation  in  the  absence  of  a 
"training"  period  as  would  be  the  case  with  blinking  jammers. 

Another  strategy  which  can  reduce  the  possible  signal 
cancellation  to  a  limited  extent  is  to  impose  the  following 
quadratic  constraint 

w“w<(^  (2.1c) 

There  are  several  different  but  somewhat  related  ways  in  which 
the  quadratic  constraint  (2.1c)  appears  in  the  adaptive  antenna 
processing  literature  [2,3,9].  Here,  the  rationale  is  that  if 
the  signal  covariance  matrix  Rg  is  changed  by  some  ARs=^  (for 
any  arbitrary  matrix  A  of  norm  1)  then  the  sensitivity  of  the 
array  output  signal-to-interference  plus  noise  ratio  (SINR)  with 
respect  to  ^  is  directly  dependent  on  ||w||.  Constraint  (2.1c) 
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thus  limits  the  sensitivity  of  the  beamformer  output  SINR  to  Rg 
or  equivalently  to  the  distribution  of  signal  sources.  Since  the 
beamformer  does  not  distinguish  between  interference  and  desired 
signals  except  via  (2.1b),  it  is  clear  that  the  reduction  in  SINR 
sensitivity  is  achieved  at  the  cost  of  reduced  capability  to 
eliminate  the  power  of  (weaker)  jammers.  Thus,  in  the  selection 
of  5  in  (2.1c),  there  is  a  tradeoff  between  these  two 
contradictory  requirements.  Minimization  of  the  index  in  (2.1a) 
subject  to  (2.1b)  and  (2.1c)  via  Lagrangian  method  yields  the 
following  solution: 


o_  (R  +  gI)-‘d 

d”(R  +  £l)'d 


(2.2) 


where  s  is  an  implicit  function  of  5  and  a  higher  value  of  e 
results  in  reduced  interference  rejection  capability.  Since  SINR 
is  not  changed  by  an  arbitrary  scaling  of  weight  vector,  the 
denominator  term  in  (2.2)  may  be  dropped  without  changing  the 
beamformer  output  SINR.  The  solution  (2.2)  is  same  as  the 
steady-state  solution  of  the  original  analog  algorithm  of 
Howells-Applebaum  [5,6]  derived  on  a  more  intuitive  basis.  In 
practice,  R  is  unknown  and  (R+el)  may  be  replaced  by  the 
following  recursive  estimate  with  R(0)=el, 


R  =  AR(k-l)  +  (l-A)x(k)x''(k)  +  (l-A)d  ;  0<X<\  ;  k  =  l,2,... 
=  aR'(k)d 


(2.3a) 

(2.3b) 


where  a  is  a  constant  to  ensure  that 


=  K  for  any  constant  K 


independent  of  k.  Note  that  when  R  is  not  a  constant  (x(k) 
process  is  nonstationary)  but  varies  slowly  with  time,  the 
exponential  weighting  factor  X  in  (2.3a)  can  be  appropriately 
adjusted  to  track  this  variation.  It  may  also  be  noted  that 
within  an  appropriate  (dominant)  time  constant  associated  with 
the  variations  of  R,  one  needs  to  compute  (2.2)  or  (2.3b)  only 
once,  however,  one  needs  to  update  (2.3a)  many  times  faster  to 
obtain  reliable  estimate  of  R(k) .  Algorithm  (2.3)  is  referred  to 
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as  sample  matrix  inversion  (SMI)  algorithm. 

Correlation  Feedback  Algorithm 

Correlation  feedback  algorithm  is  a  simpler  reactive 
version  of  (2.2)  and  has  been  extensively  explored  in  the 
literature  [2-6].  In  this  procedure,  the  weight  vector  w  is 
decomposed  as 

w  =  w,+w„  (2.4) 


where  is  the  projection  of  w  on  the  subspace  C  of  the 
parameter  space  wherein  the  linear  constraint  (2.1b)  is  satisfied 
and  Wu  is  the  component  in  the  subspace  orthogonal  to  C-  The  w^ 
component  corresponding  to  the  solution  (2.2)  and  given  by  w^  = 
d/(d^d)  is  independent  of  the  process  x(k)  and  thus  the 
optimization  is  carried  out  with  respect  to  only  w^.  One  of  the 
robust  versions  of  the  several  variations  of  the  resulting 
algorithm  [3,10]  has  the  following  form: 


w(k  + 1)  =  +PJw(k)-/iR(k)w(k)]  (2 . 5a) 

R(k)  =  /lR(k-l)  +  (l-A)x(k)x”(k)  +  (l-/l)fI  (2.5b) 

P,  =  I  -  d(dHd)"ldH  (2.5c) 


A 

In  (2.5),  Pjis  the  projection  operator,  R(0)=fl  is  some  non-zero 
initial  estimate  of  R  and  the  step  size  ^  is  selected  to  be 
sufficiently  small  so  as  to  ensure  convergence  of  the  algorithm. 

A 

Note  that  (2.5)  only  requires  the  update  of  R  (k)  but  does  not 
involve  a  matrix  inversion.  A  computationally  simpler  version  of 
(2.5)  is  obtained  by  substituting  X,=o  and  is  given  by 

w(k  +  l)  =  +PJ(l-/i£')w(k)-/ix(k)z*(k)]  (2.6) 

It  may  be  noted  that  apart  from  the  projection,  algorithm  (2.6) 
does  not  require  any  matrix  operation.  However,  for  a  given  step 
size  fi  (and  therefore  the  convergence  rate),  (2.6)  has  higher 
"algorithm  noise"  and  thus  would  result  in  a  higher  steady-state 
jammer  power  at  the  beamformer  output,  known  as  excess  mean 
square  error  (EMSE)  in  the  literature  [11].  Algorithm  (2.6) 
while  computationally  much  simpler  compared  to  (2.2)  has  the 
disadvantage  of  a  much  slower  convergence  rate  which  is  a 
function  of  |i.  It  may  be  shown  [11]  that  for  convergence. 
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Tr[R]+N£ 

and  the  EMSE  is  the  weighted  sum  of  the  terms  — fxe— where 
X„  denotes  the  nth  eigenvalue  of  R.  It  is  easily  seen  that  for 
high  condition  number  ratio  {^^1 X^)  of  R  (as  is  the  case  with 
the  problem  under  study) ,  the  term  associated  with  the  minimum 
eigenvalue  X^  will  have  very  slow  convergence  resulting  in  an 
overall  slow  convergence  of  the  algorithm.  For  a  typical  example 
X^=i,  £=i,  and  N=7  a  conservative  estimate  of  the 
maximum  value  of  p  is  9.5  x  10“7.  with  this  value  of  |i,  the  slow 
mode  will  converge  as  (1-1.9x10-6)  2k  ^/ith  the  associated  time 
constant  r«l/ (4/^1^)  =  2.6 *10^  and  adaptation  time  of  approximately 
106  samples.  For  higher  values  of  N  and  the  condition  number, 
the  convergence  rate  would  be  even  lower.  In  dynamic  scenarios, 
such  low  convergence  rates  may  be  unacceptable.  On  the  other  hand 
the  error  in  (2.3)  is  primarily  due  to  estimation  error  in  R  and 
thus  convergence  of  MSE  to  its  steady-state  value  will  occur  in 
order  l/(l-X,)  samples.  For  example,  with  X=.95  this  is  only  about 
20  samples.  Quantized  state  schemes  are  now  considered  which  are 
computationally  very  attractive  as  compared  to  SMI  but  have  same 
order  of  convergence  times  as  for  SMI.  In  the  first  instance, 
consider  algorithms  for  the  unconstrained  minimization  problem 
derived  on  the  basis  of  geometric  projection  [12],  which  are  then 
adapted  to  the  constrained  optimization  problem  under 
consideration. 

A  Generalized  Adaptive  Algorithm  —  Geometric  Approach 

Consider  the  beamformer  problem  wherein  a  pilot  signal 
s(k)  from  the  desired  signal  source  is  given  (instead  of  the 
steering  vector  d)  and  the  weight  vector  w  is  selected  to 

minimize  the  MSE  =  E[|s(k)- w»x(k)f].  with  w°  denoting  the  optimum 

solution,  one  obtains, 

s(k)  =  w°“x(k)+v(k)  (2.7) 

where  due  to  the  orthogonality  principle  of  minimization,  v(k)  is 
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orthogonal  to  x(k).If  it  is  assumed  that  x(k)  is  a  Gaussian 
process,  then  v(k)  is  also  independent  of  x(k)  .  Thus,  one  may 
consider  an  equivalent  problem  of  identifying  the  unknown 
parameter  vector  w°  from  the  signal  model  (2.7).  In  the 

geometric  approach  to  the  solution  [12],  given  an  arbitrary 
initial  estimate  w  of  w°,  one  first  projects  w  onto  the 

hyperplane  H)^:  s(k)-w^x(k)  =  o;  so  as  to  minimize  the  weighted 
"norm" 

||w-Wh£  ;Wh€H 

(the  indices  of  H,  wh,  x,  and  B  have  been  dropped  for 

convenience)  with  respect  to  matrices  B(k)  which  satisfy  the 
following  conditions: 

BB*x  =  x  ;  x”B*x  =  l  (2.8) 

where  B*  denotes  Moore-Penrose  pseudo-inverse  of  matrix  B.  The 
optimum  projections  are  simply  given  by 
=w+  B''(k)x(k)  [s(k)-w"x(k)]* 

In  the  second  step,  each  hyperplane  is  represented  by  its 

respective  projection  and  one  then  minimizes  the  following 

sum  to  obtain  termed  the  "center  of  the  mass"  of  the 

hyperplanes.  That  is 

Minimize  Xi||w-WHj_  (2.10) 

for  some  appropriate  sequence  of  matrices  B(k)>o.  A  simpler 
(recursive)  algorithm  is  obtained  if  w^^in  (2.8)  is  replaced  by 

denoting  the  projection  of  w(k-l)  onto  Hj^;.  The  resulting 

recursive  solution  is  given  by 

w(k)  =  w(k  - 1)  +  P(k)B(k)B*(k)x(k)^*  (1^) 

4(k)  =  s(k)-w”(k-l)x(k) 

P'(k)=P'(k-l)+B(k)  ;k  =  l,2,...  (2.11) 

It  is  easily  seen  in  view  of  (2.8)  that  for  the  case  with  either 
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B(k)  =  B(k)  or  B(k)  =  x(k)x“(k),  the  algorithm  (2.11)  reduces  to  the 
recursive  least-squares  (RLS)  algorithm  [13,14]. 
w(k)  =  w(k-l)  +  P(k)x(k)^*(k) 

F'(k)=p-'(k-l)+x(k)x»(k)  (2.12) 

Also  with  B(k)=b(k)I  for  some  scalar  sequence  b  (k)  ,  (2.!;.) 

reduces  to  the  following  generalized  version  of  the  stochasi^ic 
gradient  algorithm  (2.6)  . 

w(k)  =  w(k-l)  +  Kk)B*(k)x(k)^*(k)  (2.13) 

where  y(k)  is  an  appropriate  scalar  sequence  related  to  the  b  (k) 
sequence.  Another  solution  of  most  interest  here  is  achieved  by 
the  following  selection  of  matrix  B  (k)  .  Note  that  the  term 
under  the  summation  in  (2.10)  is  required  to  be  some  appropriate 
measure  of  the  distance  between  a  point  w  and  the  hyperplane 
containing  the  point  with  the  property  that  the  distance  be 

non  zero  if  and  be  zero  otherwise.  A  Hermitian  symmetric 

matrix  B  (k)  satisfying  this  property  is  given  by 

B(k)  =  x(k)x“(k)+x(k)x”(k) 

Re[Xi(k)]  =  sgn{Re[xKk)]} 

Im[Xi(k)]=sgn{Im[Xi(k)]} 

where  x^  (resp.  xj  denote  the  ith  component  of  the  vector  x  (resp. 
X),  and  Re(z)  and  Im(z)  respectively  denote  the  real  and 
imaginary  parts  of  any  complex  variable  z.  It  is  easily  seen  for 
this  selection  of  B  (k)  and  any  arbitrary  vector  q,  that  q^Bq=o 
if  q  is  orthogonal  to  x(k)  i.e.,  q  is  parallel  to  and  qHBq>o 
if  q  has  a  component  in  the  direction  of  x(k)  .  With  this 
selection  of  B (k)  and  B(k)=x(k)x^(k) ,  the  general  algorithm 
(2.11)  reduces  to 

w(k)  =  w(k-l)  +  P(k)  [x(k)+  x“(k)x(k)  (x'^(k)x(k))‘x(k)]^* (k) 
p-‘  (k)  =  F'  (k  - 1)  +  x(k)x»  (k)  +  x(k)x”  (k)  (2.14) 

In  some  situations  such  as  when  x(k)  is  a  Gaussian 
Stationary  process  (with  each  of  its  components  having  equal 
variance)  ,  it  may  be  shown  that  E[xx^]  is  a  Hermitian  matrix  and 
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therefore,  asymptotically  the  matrix  P“l(k)  in  (2.14)  is  same  as 
that  obtained  by  deleting  the  last  term  in  its  update  (ignoring 
the  factor  of  2)  .  Moreover,  since  the  two  vectors  x  and  x  will 
have  on  the  average  an  small  angle  between  them,  one  may  simplify 
the  algorithm  by  retaining  only  the  x  (k)  term  in  the  first 
update  of  (2.14)  giving  rise  to  the  following  algorithm: 
w(k)  =  w(k-l)  +  P(k)x  (k)  ^*(k) 

p-'(k)=p-'(k-l)+x(k)x”(k)  (2.15) 

When  compared  to  the  RLS  algorithm  (2.12),  both  the  algorithms 

(2.14) ,  (2.15)  have  the  computational  advantage  in  that  the 

update  of  the  matrix  P“^(k)  in  (2.14),  (2.15)  requires  only 

additions  as  compared  to  complex  multiplications  and  additions 
in  the  RLS  algorithm.  Also,  the  first  update  in  (2.15)  also 
requires  order  additions  compared  to  multiplications  in 
RLS.  However,  all  three  algorithms  require  a  matrix  inversion. 
This  problem  can  be  surmounted  by  updating  P“^(k)  for  every  k 
but  updating  P  (k)  only  once  in  every  N  iterations,  say.  This 
will  reduce  the  computational  requirements  for  the  algorithms 

(2.14)  and  (2.15)  to  only  order  N  multiplication  and  order 
additions  per  update  of  the  weight. 

There  are  equivalent  non-recursive  forms  [11-14]  to  the 
RLS  algorithm  and  the  QS  algorithm  (2.15)  which  under 
stationarity  assumptions  approach  the  following  solutions  as  k->ao 

;  R„.E(xx"]  ;  -E[xs*l  (2.16a) 

w®=Rir,.  ;  R,,  =  E[xx")  ;  rj.=E[xs*]  (2.16b) 

With  the  introduction  of  a  quadratic  constraint  and  replacing 
r„  or  fjj,  term  by  the  steering  vector  d,  (2.16a)  reduces  to  (2.2) 
while  (2.16b)  yields 

=  (R,,  +  ffI)'d  (2.17) 

To  obtain  fast  algorithms  whose  computational  requirements  are 
even  smaller  compared  to  those  in  (2.15)  or  (2.17)  above,  the 
following  transformation  in  the  parameter  space  is  considered 
next. 

Transformation  of  Parameter  Space 
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If  one  selects  Y(k)=l  and  B(k)=I  in  the  gradient 
algorithm  (2.13)  which  corresponds  to  the  selection  of  B(m)=l, 
B(k)=0,  k<m  in  the  optimization  index  (2.10)  and  the  Euclidean 
norm  in  (2.9),  the  projection  operation  can  be  illustrated  as  in 

■a 

Figure  2  assuming  the  noise  free  case.  In  the  presence  of  noii’i, 
y  is  less  than  1  and  the  situation  will  be  similar.  As  Tis 
evident  from  the  Figure  2,  the  algorithm  has  a  faster  converge' ice 
rate  if  successive  hyperplanes  are  nearly  orthogonal  compared  to 
the  case  when  they  are  not.  Near  orthogonality  of  hyperplanes , is 
best  achieved  when 

E[x(k)x”(j)]  =  I^k-j)  (2.18) 

Assuming  that  {x(k) }  is  a  nearly  independent  process,  the 
condition  (2.18)  can  be  satisfied  for  a  suitable  transformed 
state  x^^(k)=L  x(k)  for  some  appropriate  linear  operator  L,  for 
example,  L  =  S“^  with  SsH=Rxx  representing  square  root 

factorization  of  the  matrix  Rxx*  With  a  corresponding 
transformation  in  the  parameter  space  w"  =  (L’‘)”w°  one  obtains, 
w"(k)  =  w"(k- 1)+  Kk)x"(k)4*  (k)  (2.19) 

Premultiplying  equation  (2.19)  by  one  obtains  the  following 
fast  version  of  the  gradient  algorithm  (2.13): 

w(k)  =  w(k  - 1)  +  ;^(k)L“L  x(k)^*  (k)  (2.20) 

Similarly  if  one  starts  with  a  quantized  state  version  of 
(2.13)  obtained  by  replacing  x(k)  by  x  (k)  for  computational 
simplification  (which  may  also  be  derived  directly  from  (2.9), 
(2.10)  for  appropriate  selection  of  the  matrices),  its 

corresponding  version  with  accelerated  convergence  is  given  by, 
w(k)  =  w(k-l)  +  P(k)x(k)  ^*(k) 

P*(k)=P‘(k-l)  +  x(k)x“(k)  (2.21) 

The  update  of  P  ' (k)  in  (2.21)  requires  only  n2 
increments /decrements  as  compared  to  complex  multiplications 
or  full  additions  in  algorithms  (2.12)  and  (2.15)  respectively. 
See  also  the  discussions  following  (2.15)  regarding  the  update  of 
w(k). 

Minimization  of  Power  in  Clipped  Interference  'Vector 
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A  doubly  quantized  algorithm  can  also  be  obtained  as  a 
result  of  the  following  minimization  problem: 

Min  E[|x”w^]  =  w“Rw  ;  R  =  [xx“]  (2.22a) 

Subject  to:  d“w  =  l  (2.22b) 

w“w  =  <r^  (2.22c) 

The  solution  to  (2.22)  is  obtained  as  in  terms  of  an  exponential 
weighting  factor  as 

w^(k)  =  R' d 

t  =  /lt(k-l)  +  (l-A)x(k)x»(k)  +  (l-A)£l  (2.23) 

This  minimization  is  meaningful  as  in  the  case  of  MBA  beamforming 
problem,  the  subspaces  spanned  by  {x (k) }  and  {x(k}  sequence  are 
"nearly”  same  and  thus  the  weight  vector  (k)  will  produce  a 
null  in  this  latter  subspace  as  well. 

It  is  straightforward  to  obtain  constrained  recursive 
versions  of  the  algorithms  (2.12)-(2.15) , (2.21) ,  and  (2.23).  The 
algorithm  corresponding  to  (2.12)  is  referred  to  as  the 
constrained  version  of  Newton's  or  conjugate  gradient  method. 
This  algorithm  is  computationally  intensive  requiring  order 
multiplications  per  update  and  as  evidenced  by  some  simulations 
exhibits  poor  stability  due  to  the  projection  operation.  In 
terms  of  various  performance  criteria,  the  constrained  version  of 
(2.21)  with  novel  step  size  selection  )i(k)  provides  the  most 
attractive  algorithm  .  This  novel  algorithm  will  be  referred  to 
as  conjugate  gradient  with  quantized  states  (CQ2)  and  is  given 
below. 

w‘^‘^^(k)  =  Wj  +P^  [w^^(k-l)-;^k)r'(k)x(k)z*(k)] 

T(k)  =  AT(k-l)  +  (1-A)x(k)x"(k)  +  (1-A)fl  ;  T(0)=el  (2.24) 

Another  significant  algorithm  obtained  as  a  constrained 
and  recursive  version  of  (2.23)  and  termed  conjugate  gradient 
algorithm  with  quantized  states  l(CQl)  is, 

=w,+PJw'''5'(k-l)-Mk)r'(k)x(k)z*(k)]  ;  z(k)=x"(k)w'^'(k-l)  (2.25) 


with  the  update  for  T(k)  same  as  in  the  algor ithm(2 . 24)  .  As  is 
apparent  from  the  comparison  of  equations  (2.24)  and  (2.25),  the 
CQl  and  CQ2  algorithms  although  derived  in  quite  different  ways, 
differ  only  in  terms  of  the  prediction  error  z  (k)  in  CQl  (resp. 
z(k)  in  CQ2) .  In  terms  of  the  computational  requirements  there  is 
hardly  any  difference  among  the  two  algorithms.  However,  in  terms 
of  other  performance  criteria,  the  differences  can  be  significant 
and  it  would  be  of  interest  to  evaluate  both  of  them  in  terms  of 
simulation  and  analysis.  It  is  also  clear  that  the  dominant 
computations  involved  in  the  parameter  update  of  CQl  and  CQ2  are 
in  the  product  T'(k)x(k)  requiring  order  additions,  while  the 
update  of  T(k)  requires  only  increments /decrements.  The 

algorithms  do  require  inversion  of  a  matrix.  However,  as 
discussed  with  reference  to  equation  (2.15),  by  updating  T"’^(k) 
(as  against  T(k))  once  in  every  N  iterations,  the  computational 
requirements  can  be  reduced  to  order  N  multiplications  and  order 
n2  additions. 

Quantization  Threshold 

From  the  simulation  studies  presented  in  the  subsequent 
section  it  turns  out  that  the  quantized  state  algorithms  derived 
above  and  with  the  quantization  defined  in  terms  of  the  signum 
function,  have  relatively  poor  convergence.  The  reason  for  this 
poor  convergence  behavior  is  as  follows.  Since  the  observed 
process  x(k)  consists  of  both  the  noise  and  interference  , taking 
sign  of  the  components  of  x(k)  will  result  in  significant  loss  of 
information  about  the  interference  source.  Thus  for  proper 
cancellation  of  the  interference  only  the  interference  component 
of  x(k)  should  influence  the  adaptive  algorithm.  A  simple  method 
to  achieve  this  objective  is  to  place  a  deadzone  (threshold) in 
the  quantization  process,  that  is  to  replace  the  signum  function 
in  the  various  quantized  state  algorithms  by  the  following 
modified  quantization  function, 

(+1  ;  x>th 

quan(x)=  { -1  ;  x<-th 
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[O  ;  otherwise 

for  an  appropriate  value  of  the  threshold  th.  For  example,  one 
can  simply  select  th=3a  where  a  is  the  rms  value  of  the  receiver 
noise.  Note  however  that  it  is  possible  to  optimize  the 
performance  of  the  algorithm  to  a  certain  extent  with  respect  to 
th.  More  generally  at  the  cost  of  some  additional  computations, 
multiple  thresholds  may  also  be  incorporated  in  the  quantization 
process.  Such  optimizations  and  extensions  will  be  studied  in  the 
subsequent  research.  In  this  paper  for  the  sake  of  simplicity, 
the  notation  x  is  used  to  refer  to  this  modified  quantization 
process  as  well. 

3.  Performance  Evaluation 

In  the  following,  the  performance  of  various 
algorithms  is  presented  in  brief  terms.  The  computational 
requirements  of  these  algorithms  have  already  been  discussed  in 
section  2.  For  the  evaluation  of  various  performance  indices,  an 
MBA  antenna  with  7  beams  arranged  in  a  hexagonal  geometry  as 
depicted  in  Figure  3  is  considered.  Each  beam  has  a  3dB  beam 
width  of  2.5^  with  a  3^  separation  among  the  centers  of  adjacent 
beams.  With  this  antenna,  two  jammer  configurations  are 
considered  for  simulations.  In  the  first  configuration,  a  single 
jammer  is  considered  located  anywhere  in  the  area  covered  by  the 
MBA  beam.  The  second  configuration  involves  two  jammers  with 
various  relative  power  levels.  The  condition  number  (CN)  of  the 
update  matrix  was  computed  by  simulations  for  different 
algorithms.  Note  that  as  for  as  the  condition  number  is 
considered,  one  needs  to  consider  only  three  cases  corresponding 
to  the  SMI  algorithm,  QS  algorithm  (2.16b)  and  the  QQ  (and  CQl  or 
CQ2 )  algorithms . 

Figure  4  depicts  the  results  when  one  of  the  jammers  is 
located  at  2  deg  azimuth  and  2.5  deg  elevation  with  a  power  level 
of  28dB  above  the  receiver  noise  level  (corresponding  to  70  dBW 
jammer  power  for  a  certain  satellite  link  geometry) .  The  other 
jammer  located  at  1.5  deg  azimuth  and  2.5  deg  elevation  has 
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varying  power  levels  between  40  to  90  dBW.  The  value  of  e  was 
selected  equal  to  1  for  SMI  and  QS  algorithms  and  equal  to  .01 
for  the  QQ  algorithms  so  as  to  achieve  approximately  0  dB  total 
interference  plus  noise  power  level  (corresponding  to  complete 
interference  cancellation)  at  the  beamformer  output.  As  shown  In 
the  figure  the  CN  for  SMI  algorithm  is  high  and  is  very  sensit.'Ve 

•jj 

to  power  level  difference  among  jammers  (essentially  it  is  equal 
to  the  power  ratios) ,  while  the  CN  for  the  QQ  algorithm  is  very 
low  and  is  least  dependent  upon  the  power  levels .  The  results  of 
the  figure  can  be  approximated  by  the  following  relationships, 

(CN)^  »  (CNC 

(CN)„3  »  ((CNU  (CN)^ 

Since  the  required  arithmetic  precision  and  the  resulting 
numerical  stability  of  the  algorithm  depends  mainly  upon  the 
condition  number  of  the  matrices  involved  in  the  computations, 
the  above  result  is  highly  significant  clearly  delineating  the 
superiority  of  the  proposed  quantized  state  algorithms  with 
respect  to  these  criteria. 

Figure  5  plots  the  convergence  results  for  various 
algorithms  for  the  case  of  single  jammer.  Note  that  both  the  SMI 
and  QQ(th=10)  achieve  complete  nulling  of  the  interference  in 
about  4  iterations  (data  samples)  while  the  QS  algorithm 
requires  about  10  iterations.  The  threshold  for  the  QS  algorithm 
was  set  to  0  to  clearly  show  the  importance  of  the  proposed 
deadzone  in  the  quantization  process.  Actually  the  QQ  algorithm 
with  th=0  did  not  converge  as  is  also  evident  from  the  figure. 
For  the  correlation  feedback  algorithm  the  maximum  value  of  u  is 
.001  from  the  previously  presented  analysis.  For  this  value  of  |i 
the  convergence  rate  is  rapid  but  results  in  about  5  dB  residual 
interference  power  in  steady  state.  A  lower  value  of  u  of  course 
results  in  much  slower  convergence.  In  any  case  in  practice  one 
has  to  use  conservatively  low  value  of  u  to  ensure  convergence. 
Figure  6  shows  the  convergence  of  the  conjugate  gradient 
algorithm  (CG)  for  various  values  of  the  parameter  |i  for  the  case 
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of  single  jammer.  As  is  apparent  from  the  figure  the  CG  algorithm 
exhibits  poor  convergence  behavior  even  for  this  relatively 
simple  scenario.  An  attempt  to  significantly  modify  this 
algorithm  was  not  made  as  the  algorithm  is  not  attractive  in 
terms  of  other  performance  criteria  as  well.  The  convergence 
results  for  the  second  jammer  configuration  are  plotted  in  Figure 
7  wherein  the  second  jammer  has  38  dB  relative  power  level. 
Clearly  while  the  quantized  state  algorithms  have  convergence 
rates  similar  to  that  of  the  SMI  algorithm,  these  are  orders  of 
magnitude  faster  than  the  correlation  feedback  algorithm 
depending  on  the  value  of  |i,  selected  for  the  CF  algorithm.  Higher 
value  of  |i  of  course  results  in  higher  power  of  the  residual 
interference. 

The  steady-state  weight  vector  is  used  to  obtain  the  gain 
pattern  of  the  adapted  MBA  antenna  beam  from  which  a  distribution 
function  of  the  antenna  gain  is  evaluated  both  for  the  SMI  ( 
optimal  in  terms  of  SINK  for  a  signal  source  from  the  specified 
direction  d  )  and  the  proposed  quantized  state  algorithms.  From 
some  simulations  (not  reported  here  due  to  space  limitations) ,  it 
was  observed  that  considering  receivers  having  antenna  gains 
within  10-15  dB  of  the  maximum  gain  of  the  quiescent  pattern,  the 
quantized  state  algorithms  can  provide  much  higher  percentage  of 
the  area  covered  in  the  zone  surrounding  the  jammers.  In  the 
jammer  scenario  considered  above  and  evaluating  the  gain 
distribution  in  the  upper  half  of  the  area  under  the  quiescent 
beam  (0,5. 5) deg  azimuth,  (-5. 5, 5. 5) deg  elevation  ,the  covered 
area  by  QQ/QC  algorithms  was  found  to  be  up  to  60%  higher 
compared  to  the  SMI  algorithm.  However  there  is  significant 
variability  associated  with  this  improvement.  The  improvement  is 
higher  for  higher  power  ratios  of  the  jammers  and  relatively 
higher  link  margins.  More  simulation  results  are  needed  to  gain 
further  analytical  insights  and  obtain  maximum  possible 
improvement  in  this  respect. 
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4.  Conclusions 


It  has  been  demonstrated  in  this  report  that  the 
quantized  state  algorithms  proposed  for  the  suppression  of 
interference  from  an  MBA  antenna  exhibit  convergence  rates 
similar  to  that  of  the  sample  matrix  inversion  (SMI)  algorithm 
and  the  rates  are  orders  of  magnitude  faster  than  the  correlation 
feedback  algorithm.  The  computations  involved  in  the  quantized 
state  algorithms  are  dominated  by  binary  operations  and  thus  the 
algorithms  afford  very  significant  computational  savings  over  the 
SMI  algorithm.  In  fact  with  simple  variations  in  the  update,  the 
computational  requirements  can  be  made  to  be  similar  to  those  of 
correlation  feedback  algorithm  without  any  significant  reduction 
in  the  convergence  rate.  The  matrices  involved  in  the  doubly 
quantized  algorithms  have  their  condition  number  approximately 
equal  to  the  square  root  of  those  in  the  SMI  algorithm  and  thus 
the  proposed  algorithms  are  numerically  robust  when  finite 
precision  arithmetic  is  used  in  their  implementation  using  DSP 
chips.  Due  to  certain  inherent  detection  involved  in  the 
quantized  state  algorithms,  they  also  provide  much  better 
resolution  capability  and  thus  an  increased  area  coverage  in  the 
region  surrounding  the  jammers.  In  a  few  simulation  examples,  the 
improvement  in  terms  of  area  coverage  turned  out  to  be  as  much  as 
60%  over  the  SMI  algorithm.  Due  to  considerable  variability  in 
this  result,  more  simulation  studies  are  required  to  gain  further 
analytical  insights  in  order  to  maximize  such  improvement.  It 
would  also  be  of  considerable  significance  to  evaluate  the 
performance  of  these  algorithms  when  the  adaptation  is  performed 
in  the  presence  of  desired  signal  sources.  It  is  anticipated  that 
due  to  an  implicit  detection  in  the  proposed  algorithms  the 
signal  cancellation  if  any  will  be  much  smaller  compared  to  the 
SMI  algorithm.  Another  very  important  area  of  investigation  is  to 
consider  more  sophisticated  configurations  than  just  a  linear 
combiner.  This  would  be  particularly  interesting  in  view  of  the 
computational  power  released  by  the  quantized  algorithms.  That 
is,  for  a  given  amount  of  computational  power  available  much  more 
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sophisticated  configurations  can  be  implemented  by  using 
quantized  state  algorithms  instead  of  the  SMI  algorithm. 
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Figure  3.  Beam  positions  of  the  multipie-beam  antenna 
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Convergence  of  various  algorithms  (single  jammer  28  dB  above  noise) 
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Figure  6.  Convergence  of  conjugate  gradient  algorithm  with  projection 
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Figure  7.  Convergence  of  various  nonrecursive  algorithms 
(two  jammers  28dB  and  38dB  above  noise) 
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Figure  8.  Convergence  of  various  recursive  algorithms 
(two  jammers  28dB  and  38dB  above  noise) 
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Figure  9.  Convergence  of  correlation  feedback  algorithm 
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Figure  1 1 .  Contour  plot  of  antenna  pattern  ,SMI  algorithm 
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Figure  12.  Contour  plot  of  antenna  pattern,  QC2  algorithm 


5-32 


UPDATING  PROBABILISTIC  DATABASES 


Michael  A.  Pittarelli 
Associate  Professor 

Department  of  Computer  and  Information  Science 


State  University  of  New  York 
Institute  of  Technology  at  Utica/Rome 
P.O.  Box  3050 
Utica.  NY  13504-3050 


Final  Report  for; 

Summer  Faculty  Research  Program 
Rome  Laboratory 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  Air  Force  Base.  Washington.  D.C. 


August  1993 


6-1 


UPDATING  PROBABILISTIC  DATABASES 


Michael  A.  Pittarelli 
Associate  Professor 

Department  of  Computer  and  Information  Science 
SUNY  Institute  of  Technology  at  Utica/Rome 

Abstract 

A  probabilistic  database  is  a  generalization  of  a  relational  database  that  replaces  the 
characteristic  function  of  a  finite  relation  with  a  finite  real-  or  interval-valued  probability  distribution. 
In  this  report,  methods  are  explored  for  updating  real-valued  probabilistic  databases  in  response  to 
new  information,  reassessment  by  experts  of  certain  probabilities,  or  in  order  to  smooth  a  relative 
frequency  distribution  based  on  limited  observations.  A  distinction  is  made  between  global  updating, 
which  involves  selecting  a  member  of  the  set  of  joint  distributions  each  member  of  which  has  the 
given  database  as  its  projection,  updating  it  appropriately,  then  projecting:  and  local  updating,  in 
which  changes  to  one  distribution  are  propagated  to  the  others,  in  order  to  restore  consistency  of  the 
database.  The  appropriateness  of  Jeffrey's  Rule  of  conditioning  for  propagating  local  changes  in 
probability  is  discussed.  Schemes  used  for  updating  the  probabilities  associated  with  causal,  or 
Bayesian,  networks  are  shown  to  be  applicable  to  probabilistic  databases  whose  schemes  are  a- 
acyclic. 
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UPDATING  PROBABILISTIC  DATABASES 


Michael  A.  Pittarelli 


I.  Introduction  and  Definitions 

Probabilistic  data  models  and  their  potential  for  decision  support  are  discussed  in  [2,6,25,29,30]. 
In  this  report,  methods  for  updating  probabilistic  databases  are  presented. 

Attention  will  be  restricted  to  the  probabilistic  model  introduced  in  [6]  and  elaborated  in  [30]. 
Within  this  model,  a  probabilistic  database  instance  is  a  collection  of  probability  distributions  over 
sets  of  tuples.  Formally,  let  dom(M)  denote  the  finite  domain  of  attribute  M.  The  set  of  attributes  V/ 

on  which  an  element  p/  of  a  database  K  =  {p^^ . p„}  is  defined  is  its  scheme,  denoted  s(p/).  For 

a  set  V  of  attributes,  let  dom(V)  =  x  dom(v).  Then,  for  any  P/^K.  p;:dom(s(p/))— >^[0,1],  where 

if€V 

E,P»  =  1- 

The  probabilistic  database  instance  in  the  tables  below  might  represent  uncertainties  regarding 
conditions  in  Freedonia  on  a  typical  August  day: 


Rain 

No_Phones 

Pi 

No_Phones 

Temperature 

P2 

Temperature 

Late_Trains 

Ps 

yes 

true 

3JS 

true 

high 

1/3 

high 

yes 

1/8 

yes 

false 

1/16 

true 

med 

5/36 

high 

no 

3/8 

no 

true 

1/8 

true 

low 

1/36 

med 

yes 

5/18 

no 

false 

7/16 

false 

high 

1/6 

med 

no 

5/36 

false 

med 

5/18 

low 

yes 

5/72 

false 

low 

1/18 

low 

no 

1/72 

With  this  model,  it  would  be  most  natural  to  maintain  separate  databases  for  each  month.  In 
the  model  presented  in  [2],  a  single  database  consisting  of  individual  probabilistic  relations  could  be 
used  instead.  It  is  shown  in  [25]  how  to  translate  from  either  representation  to  the  other,  in  this 
report,  attention  is  restricted  to  the  simple  model. 

Probabilistic  data  operators  and  their  properties  are  presented  in  [30].  We  will  utilize  only  three 
of  these:  projection,  extension,  and  Join. 

Definition  Let  ACs(p).  Then  )ry)(p)  denotes  the  projection  of  distribution  p  onto  A. 

^Aip){^)  =  E  P(0- 
r£</om(5(p)). 

where,  for  tuples  wGdom(W)  and  b€dom(B).  BCW,  w[B]=b  iff  w  and  b  agree  on  ail  attributes  in 
scheme  B.  □ 

An  arbitrary  but  fixed  ordering  on  the  attributes  is  assumed.  To  emphasize  that  a  tuple  is 
known  to  have  a  particular  value,  the  notation  p(t=x)  will  sometimes  be  used.  The  expression 
p(t[A])  will  be  used  to  abbreviate  ’r4(p)(t[A]).  and  p(t[A]|t[B])  will  abbreviate  p(t[AuB])/p(t[B]). 
Also,  where  TCdom(s(p)).  p(T)  will  denote 

Definition  A  model  of  a  scheme  V  is  a  set  X={Vi . V^}  such  that  VyCV  and  for 
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all  i.jG{l . m}.  (X  will  sometimes  be  referred  to  as  a  model  of  a  distribution  with  scheme  V.)  □ 

If  X  is  the  set  of  schemes  for  the  elements  of  a  database  K,  then  the  scheme,  or  structure,  of  K 
is  X.  Let  s(K)  denote  the  structure  of  K:  s(K)  =  {s(p)  |  peK}:  and  let  dom(s(K))  =  dom(  U  V). 

Unless  stated  otherwise,  the  structure  of  a  database  K  will  always  be  taken  to  be  a  model,  in  the 
above  sense,  of  U  V;.  However,  structures  that  are  not  models  are  sometimes  useful,  for  exam- 

pie,  when  a  projection  ?r4(p/)  for  some  p/€K  is  to  be  accessed  frequently. 

A  distribution  p  with  scheme  V  may  be  projected  onto  a  model  X={Vi . V^}  of  V  to  form 

a  probabilistic  database  K  =  {pj . Pm}.  where  K  =  jrx(p)={’rv^,  (p) . ’rv^^(p)}.  A  database  K 

formed  in  this  way  is  guaranteed  to  exhibit  various  forms  of  consistency;  e.g..  for  any  p/  and  py  in  K. 
=  ’r4(py).  for  all  AC s(p;) ns(py). 

Definition  A  model  X  is  a  refinement  of  model  Y  (and  Y  is  an  aggregate  or  coarsening  of  X). 
denoted  X<Y,  iff  for  each  Vj^gX  there  exists  a  V^gY  such  that  V;,CVy  [5].  □ 

A  database  K  may  be  projected  onto  a  refinement  X  of  s(K)  in  the  obvious  way  to  form  a  data¬ 
base 

^x(K)  =  {7ru,(p)|  V^gX.  V*gs(K).  V^CV*.  pgK.  pgPkJ. 

Definition  Let  Pi,  =  {p|p:dom(V)->[0,l].  X1,P(0  =  !}•  The  extension  of  a  distribution  p  with 
scheme  A  to  the  scheme  V.  ACV.  is  the  set  of  all  preimages  p'  with  scheme  V  of  p  under  the  map¬ 
ping  xa: 

E^'(p)  =  {p'eP„|,r^(p')  =  p}.  □ 

The  extension  of  a  database  K  is  the  intersection  of  the  extensions  of  its  elements: 
E^(K)  =  npg>fE''(p).  Thus.  E'^(jrx’(p))  is  the  set  of  all  preimages  of  the  database  irx(p)  under  the 
mapping  xx',  eny  model  X  of  V  partitions  Pj,  into  classes  E*^(?rx(p))  equivalent  with  respect  to  pro¬ 
jections  onto  X.  Any  E’^(p)  or  E*^(K)  is  a  convex  polyhedron,  the  set  of  solutions  to  the  system  of 
linear  equations  determined  by  the  projection  conditions.  (If  s(K)  is  a  cover  of  V.  then  E*^(K)  may 
be  abbreviated  E(K).) 

Definition  The  join,  over  a  set  of  variables  V.  of  a  database  K  =  {pi . Pm}  is  the  element 

j'^(K)  of  E'^(K)  with /nax/rnt/m  e/ifropy:  j''(K)  =  arg  max  — p(t)xln(p(t)).  □ 

V  /  \  /  peE^'iK)  ' 

(Normally.  s(K)  is  a  cover  of  V.  In  this  case,  or  when  the  context  makes  clear  what  set  of  variables 
is  intended.  J'^(K)  is  abbreviated  J(K).) 

Since  E’^(K)  is  convex.  J^(K)  is  unique  [14].  For  a  set  Q  with  unique  maximum  entropy  ele¬ 
ment,  let  J(Q)  denote  that  element.  Determination  of  J(Q)  for  arbitrary  constraints  defining  Q  is 
usually  a  difficult  optimization  problem.  However,  this  is  not  the  case  when  the  constraints  are  in  the 
form  of  a  probabilistic  database. 

Definition  The  pairwise  Join  of  two  probability  distributions  is  calculated  as 

PJ(Pi.P2)(t)  =  Pi(x)xp2(y)/  S  PzW- 

z|z[s(p,)n5(p2)]=r[s(pi)ns|p2)] 

where  x=t[s(pi)]  and  y=t[s(p2)].  (The  denominator  of  the  above  expression  equals  1  if 

s{Pi)ns(p2)  =  0.)  □ 
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Theorem  1  [22]  PJ(pi.P2)GE({pi.p2})  iff  PJ(pi.P2)  =  J({Pi.P2})-  □ 

It  follows  immediately  that  P  =  J(’r{i/i,i/'2}(p))  iff  (VinV2)  and  V2— (VinV2)  are  conditionally 
independent,  given  VinV2.  Note  that  PJ  is  not  commutative. 

When  s(K)  is  a-acyclic  [7].  J(K)  can  be  calculated  as  a  sequence  of  applications  of  pairwise  join 
[30],  s(K)  is  a-acyclic  iff  the  output  Z  of  the  algorithm  below  {Graham's  algorithm  [7])  is  {0}. 

1.  Z  :=  s(K) 

2.  Repeat  in  any  order  until  neither  has  any  effect  on  the  current  value  of  Z: 

a.  If  V  appears  in  only  one  element  of  Z.  remove  v  from  that  element. 

b.  If  V,  c  Vy  and  {V,.Vy}  C  Z.  then  Z  :=  Z-{V,}. 

If  s(K)  is  not  a-acyclic  (i.e.,  is  a-cyclic).  an  iterative  proportional  fitting  algorithm  that  converges  to 
J(K)  must  be  used  instead  [5]. 


II.  Updating  Databases 


In  the  literature  on  knowledge  bases,  a  distinction  is  made  between  updating  and  revision 
[14.15].  A  knowledge  base  is  updated  when  it  is  changed  in  response  to  a  change  in  the  state  of 
affairs  it  is  intended  to  represent.  It  is  revised  when  it  is  altered  to  represent  more  faithfully  a  fixed 
state  of  affairs.  We  will  not  make  this  distinction:  both  types  of  changes  will  be  referred  to  here  as 
updates. 

In  general,  an  update  of  a  database  instance  K  is  any  replacement  of  K  with  an  instance  K'.  We 
will  consider  a  variety  of  types  of  replacement.  For  example.  K'  =  K— {p}  and  K'  =  Ku{p}  qualify 
as  updates.  Thus,  it  is  not  required  that  s(K')  =  s(K). 

One  requirement  that  will  normally  be  imposed  is  that  the  result  K'  of  an  update  of  K  be  con¬ 
sistent.  if  K  itself  is.  A  database  K  is  consistent  iff  E(K)  =?«=  0.  A  necessary  condition  for  consistency 
is  local  consistency:  K  is  locally  consistent  iff  for  all  p;.  py  €  K, 

’’■i(p/)ns(py)(Pi)  =  ’'■s(/>,)ns(py)(py)- 

Local  consistency  is  sufficient  for  consistency  if  s(K)  is  a-acyclic.  Consistency  corresponds  to  the 
existence  of  a  universal  instance  for  a  relational  database. 

Consistent  updating  consists  of  two  phases  (the  second  of  which  may  be  null): 

(i)  K  is  replaced  by 

(ii)  K*"”''  is  replaced  by  a  consistent  K',  subject  to  some  additional  constraints. 

Step  (ii)  is  deliberately  vague.  For  example,  suppose  =  (K— S)uS'.  It  might  be  reasonable  to 
require  that 


K'  =  arg  min  J'  «f(p*.p). 


where  d  is  a  suitable  measure  of  dissimilarity  of  probability  distributions,  and  C  is  the  class  of  con¬ 
sistent  K*  such  that  s(K*)  =  s(K^®'”'’).  In  other  words,  the  difference  between  the  updated  database 
and  the  original  should  be  minimized,  subject  to  the  constraint  that  S  is  replaced  by  S'. 

To  illustrate  when  one  might  reasonably  insist  that  a  p€K‘“”^  be  contained  in  K'.  consider  an 
update  to  the  database  above  containing  probabilities  for  conditions  on  a  typical  August  day  in 
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Freedonia.  Suppose  that  we  have  become  concerned  about  the  likelihood  that  sharks  are  present 
near  the  shore,  and  have  had  an  expert  assess  joint  probabilities  for  temperature  and  shark  activity: 


Temperature 

ShallowSharks 

Pi 

high 

yes 

1/8 

high 

no 

3/8 

med 

yes 

1/12 

med 

no 

1/3 

low 

yes 

1/120 

low 

no 

3/40 

With  K  =  {P1.P2.P3}.  K'"”'’  =  {P1.P2.P3.P4}.  S  =  {p«}  and  the  minimum  distance  requirement. 
K'  =  K**'”''.  since  and  any  reasonable  distance  measure  is  nonnegative  (d(p.q)>0)  and 


nondegenerate  (d(p.q)=0  iff  p=q). 

Another  type  of  "wholesale'  replacement  of  a  p€K  with  pVp  occurs  in  databases  of  relative 
frequencies  obtained  by  sampling.  A  new  observation  will  change  all  the  entries  x  of  an  affected  dis¬ 


tribution  Pi  to  either 


Nx 
N  +  1 


or 


Nx  +  i 
N+1 


where  N  is  the  number  of  observations  that  have  already 


affected  p/.  (p/  is  affected  by  observation  t  if  t[s(p;)]  is  defined.) 


III.  Updating  Distributions 

There  are  less  radical  forms  of  replacement  of  probability  distributions;  for  example,  those  aris¬ 
ing  from  constraints  of  the  following  types; 

(/)  t[A]=w.  where  ACs(p) 

(//)  p{t[A]=w)=x 
(///)  p(t[A]=w|t[Bl=u)=x 

Constraint  type  (/)  is  the  special  case  of  (//)  in  which  x=l.  To  illustrate,  suppose  that  we  meas¬ 
ure  the  temperature  on  a  typical-seeming  August  day  in  Freedonia  and  find  that  it  is  in  the  high 
range;  where  t  is  the  tuple  representing  this  day.  we  know  t[Temperature]=high.  but  none  of  the 
other  components  of  t.  What  effect  does  this  information  have  on  P2?  Minimally.  P2  must  be  changed 
so  that  3r{7-em/,«rafor«}(p'2)(high)=l.  but  there  are  infinitely  many  p  defined  on  s(p2)  with  this  pro¬ 
perty.  With  the  additional  constraint  that  p'2(x|Temperature=high)  =  p2(x|Temperature=high).  p'2 
is  uniquely  determined  and  may  be  calculated  by  Bayesian  conditionalization  (which,  although  it  is 
standard  practice,  is  controversial  [1]]; 

p'2(No_Phones=true.  Temperature=high)  =  p2(No_Phones=true.  Temperature^high  |  Temperature=high) 


1/3 

1/3  +  1/6 


=  2/3.  etc. 


Case  {il)  when  0<x<l  is  more  controversial.  If  a  distribution  p  has  Just  been  replaced  with  p'.  it 
seems  reasonable,  prima  facie,  to  adopt  jt^vCpOCw).  for  any  w€dom(W).  WCs(p').  If  this  policy  is 
followed,  it  is  necessary,  after  replacing  p2  with  p'2.  to  replace  pi  with  a  p'l  such  that 

p'i(No_Phones=true)  =  p'2(No_Phones=true)  =  2/3. 

Suppose  that  No_Phones=true  means  that,  at  some  point  during  the  day.  phone  service  will  be 
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interrupted  for  at  least  an  hour,  and  that  disruptions  in  service  may  be  caused  by  electrical  storms, 
which  are  likelier  on  hotter  days.  Intuitively,  then,  it  is  reasonable  that  our  probability  for 
No_Phones=true  should  increase  when  we  learn  that  Temperature=high.  But  the  constraint 
p'i(No_Phones=true)=2/3  is  not  sufficient  to  identify  p'^  uniquely.  With  only  this  constraint, 
values  of  p'l  are  free  to  range  within  the  intervals  i  in  the  table  below. 

Adding  the  information  that  the  probabilities  for  No_Phones  originated  in  the  observation 
Temperature=high  and  requiring  p'i(t)  =  p(t|Temperature=high).  for  some  p€E({pi.p2.P3})  nar¬ 
rows  the  intervals  considerably,  to  i'. 

The  stronger  constraint  p'i(Rain=x|No_Phones=y)  =  pi(Rain=x|l\lo_Phones=y).  combined 
with  p'i(No_Phones=true)=2/3  is  sufficient  to  identify  p'l  uniquely,  as 

p'i(Rain=x.No_Phones=y)  =  p'i(Rain=x|l\lo_Phones=y)xp'i(No_Phones=y). 


Rain 

No_Phones 

i 

i' 

P'l 

yes 

true 

[0,2/3] 

15/12.2/3] 

1/2 

yes 

false 

10.1/31 

[0.1/8] 

1/24 

no 

true 

[0.2/3] 

[0.1/4] 

1/6 

no 

false 

[0.1/3] 

[5/24.1/3] 

7/24 

Updating  pi  to  p'l  in  this  way  is  known  as  Jeffrey  conditionalization  [13].  According  to  what  is 
referred  to  as  Jeffrey's  Rule  for  updating  probability,  if  conditional  probabilities  p(A|B/)  are  unaffected 

by  a  change  in  probabilities  for  the  elements  of  a  partition  {B^ . B„}  of  the  space  on  which  p  is 

defined,  then,  in  light  of  these  changes,  the  probability  of  event  A  should  be  updated  to 

P'(A)  =  S,.P'(A|B;)xp'(B;)=  S,.p(A|B,)xp'(B/). 

The  properties  of  Jeffrey  conditionalization  have  been  studied  by  a  number  of  authors;  the  fol¬ 
lowing  theorem  is  a  special  case  of  results  discussed  presented  in  [19.20.23,35]. 

Theorem  2  Let  p'  denote  the  Jeffrey  update  of  p  on  the  basis  of  a  shift  in  marginal  probabilities 
from  7r;,(p)  to  7r^(p)'.  for  ACs{p):  p'(t)  =  p(t[s(p)-A]|t[A])x7r;,(p)'(t[A]). 

Let  C  =  E*^'’l(jr^(p)').  Let  d  denote  directed  divergence  (relative  entropy,  relative  information): 

'•(qp)  =  (qW/p(t))- 

Then  p'  =  arg  minp»gc<l(p*.p).  □ 

Of  course.  Jeffrey  conditionalization  of  a  distribution  p  needn’t  take  place  in  response  to  an 
update  to  a  q?^p.  Suppose  that  we  think  we  have  just  felt  a  drop  of  rain  and  that  Rain=yes  means 
the  day's  official  rainfall  total,  measured  at  Freedonia  International  Airport,  is  at  least  1/8".  We  may 
wish  to  revise  our  probability  of  rain  from  7r/j3;„(pi)(yes)  =  7/16  to  7r/;3i,(pi)'(yes)  =  9/10.  in  which 
case,  by  Jeffrey’s  Rule. 


p”i(Rain=yes.No_Phones=true)  =  etc. 

To  illustrate  case  (///),  suppose  that  the  railroad  has  replaced  all  locomotive  engines  with  new 
ones  that  are  less  likely  to  break  down  on  hot  days.  Suppose  the  new  conditional  probabilities 
p(Late_Trains=x|Temperature=y)  are: 
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yes 

no 

high 

1/2 

1/2 

tried 

2/3 

1/3 

low 

5/6 

1/6 

Since  we  have  no  information  on  the  effect  of  the  new  trains  on  Temperature,  it  seems  reasonable  to 
maintain,  on  grounds  of  "insufficient  reason'  to  do  otherwise.  ^{Temperatureyip'z)  =  ^{TempentureyiPs)- 
With  this  constraint,  p'3  is  uniquely  determined,  as; 


Temperature 

Late_Trains 

p's 

high 

yes 

1/4 

high 

no 

1/^ 

med 

yes 

5/18 

med 

no 

5/36 

low 

yes 

5/72 

low 

no 

1/72 

This  is  a  distance-minimi2ing  update  as  well,  although  it  is  not  a  Jeffrey  update.  Van  Frassen 
[33,34]  discusses  problems  with  and  alternatives  to  minimum-distance  updating.  One  alternative, 
applicable  in  case  (///),  is  to  reduce  (and  only  reduce)  joint  probability  values  so  they  are  consistent 
with  the  new  conditional  values,  and  then  normalize  them.  Van  Frassen  found  that  this  update  rule 
significantly  outperformed  minimum  (relative  information)  distance  updating  and  a  third  method  on  a 
test,  using  simulated  data,  of  the  rate  at  which  convergence  to  an  "objectively  correct'  probability 
distribution  is  achieved.  Applied  to  the  example,  it  yields: 


Temperature 

LateTrains 

P"3 

high 

yes 

7/32 

high 

no 

7/32 

med 

yes 

5/16 

med 

no 

5/32 

low 

yes 

5/64 

low 

no 

1/64 

Notice  that  niTemper^tureyip's)  *  {Tfsnptr>turey(Pz) ■  Although  it  is  more  conservative  to  assume 

equality,  it  may  be.  for  all  we  know,  that  the 

new  trains 

are  more  efficient  and  thus  generate  less 

heat.  Furthermore,  requiring  that  marginal 

probabilities 

remain  unaffected  (in  this  case,  that 

p(Temperature=high)  remain  1/2)  is  incompatible  with  a 

policy  of  updating  via  standard  Bayesian 

conditionalization  whenever  it  is  applicable.  (See  [33].  p. 

379.)  Suppose  that  hyper-reliable  diesel 

locomotives  that  thrive  on  heat  have 

been  acquired  and  that 

p(Late_Trains=yes|Temperature=high)  =  1. 

Then  p(Late_Trains=no|Temperature=high)  =  0. 

which  implies  p(dom(s(p3))  — {<high.no>})  = 

1.  Bayesian  conditionalization  produces  the  distribu- 

tion  p  3,  and  ^ [Temperature}{p  z)  ^{TewperatartyiPz)- 

Temperature 

LateTrains 

P"'3 

high 

yes 

9/45 

high 

no 

0 

med 

yes 

20/45 

med 

no 

10/45 

low 

yes 

5/45 

low 

no 

1/45 
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Much  of  this  controversy  can  be  avoided  if  we  work  with  only  the  constraints  we  do  accept. 
There  is  widespread  agreement  that  the  only  reason  to  be  interested  in  probabilities  (as  degrees  of 
belief  versus,  e.g.,  relative  intensities  in  image  processing)  is  to  use  them  to  make  decisions.  A  strict 
Bayesian  will  want  to  choose  an  action  that  maximizes  expected  utility  relative  to  a  single,  known, 
real-valued  probability  distribution  over  the  events  relevant  to  the  decision.  However,  there  are 
methods  that  can  be  applied  to  decision  making  when  beliefs  are  represented  by  intervals  or  sets  of 
real-valued  probability  distributions  that  reduce  to  expected  utility  maximization  as  a  special  case 
[17.28]. 

Suppose  we  are  trying  to  decide  if  we  should  bicycle  to  the  Freedonia  lighthouse  this  evening.  If 
rain  develops  we  will  want  to  call  home  for  a  ride.  Also,  whether  we  go  or  not,  we  need  to  contact 
the  neighborhood  bookie,  by  phone  or  in  person,  between  6:00  and  7:00  p.m.  Utilities  for  the  two 
actions  and  immediately  relevant  states  are: 


Rain 

No_Phones 

Go 

Don't 

yes 

true 

0 

0.8 

yes 

false 

0.2 

0.6 

no 

true 

0.1 

0.2 

no 

false 

1.0 

0.1 

We  are  certain  that  the  temperature  is  high  and  we  have  access  to  database  K  =  {Pi-Pa-Ps}-  Rela¬ 
tive  to  distribution  p'l.  above.  "Don't"  has  higher  expected  utility:  39/80  versus  19/60.  But  p'l  was 
calculated  under  an  assumption  one  might  wish  to  reject,  viz.,  that  knowledge  of  Temperature  does 
not  affect  the  conditional  probabilities  for  Rain,  given  knowledge  of  No_Phones. 

The  assumption  is  unnecessary.  For  an  interval  distribution  i. 

R(i)  =  {peP5(,)|p(t)€i{t).  for  all  tedom(s(i))}. 

R(i)  is  the  set  of  solutions  of  a  system  of  linear  inequalities.  It  can  be  determined  via  linear  program¬ 
ming  whether  or  not  there  exists  a  p€R(i)  relative  to  which  an  action  a  maximizes  expected  utility  (in 
which  case  a  is  said  to  be  E-admissible  [21],  or  a  Bayes  action). 

For  the  bicycling  decision  problem,  relative  to  R(i').  "Don’t"  is  a  Bayes  action  and  "Go"  is  not. 
Two  linear  programs  suffice  to  determine  that  "Don't"  maximizes  expected  utility  for  all  pGR(i'), 
including  p'^.  which  it  is  therefore  unnecessary  to  calculate. 


iV.  Restoring  Consistency 

We  now  address  in  greater  detail  the  problem  of  restoring  consistency  to  a  database  when  one 
or  more  of  its  elements  is  updated.  Recall  that  the  general  form  of  a  database  update  is 

K  ^  K'. 


Suppose  that 

Kfemp  ^  (K-S)US'. 

In  our  running  example.  K  =  {pi.pa-Ps}-  Temperature  is  observed  to  have  the  value  "high",  and  p2 


and  P3  are  updated  via  Bayesian  conditionalization: 
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»_Phones 

Temperature 

P't 

Temperature 

Late_Trains 

m 

true 

high 

2/3 

high 

yes 

1/4 

true 

med 

0 

high 

no 

3/4 

true 

low 

0 

med 

yes 

0 

false 

high 

1/3 

med 

no 

0 

false 

med 

0 

low 

yes 

0 

false 

low 

0 

low 

no 

0 

_  (K-{p^.p^})u{p'2.p'3}.  K'"”'’  is  inconsistent.  s(K'“"'’)  is  a-acyclic,  as  can  be  determined 
by  application  of  Graham's  algorithm.  Since  s{K**'”'’)  is  a-acyclic,  it  can  be  made  consistent  by  elim¬ 
inating  all  local  inconsistencies.  There  is  only  one  local  inconsistency; 
t^{No  p/i(««s}(Pi)  ^  "^{No  p/ion*s}(p'2)-  Because  the  update  originated  in  the  measurement  of  a  variable 
in  s(p'2).  it  seems  rational  to  leave  p'2  fixed  and  replace  pi  with  a  p'l  that  is  consistent  with  p'2.  As 
discussed  above,  there  are  infinitely  many  peE*^***^  such  that  ^{No_Phones}{p)  =  ^{No_Phones}{p'2)- 

Recall  that  Jeffrey  conditionalization  on  the  updated  probabilities  ^{fjoPhonesyip'z)'  which 
requires  the  assumption  p'i(Rain=x|No_Phones=y)  =  pi(Rain=x|l\Io_Phones=y),  identifies  p'l 
uniquely.  Let  us  define  the  distribution  resulting  from  Jeffrey  conditionalization  of  q  in  response  to  an 
update  of  p  to  p'  as  follows; 

Definition  Let  x  =  t[s(q)  — (s(q)ns(p'))]  and  y  =  t[(s(q)ns(p')].  Then  JC(p'.q)GPj(,)  is  defined  as 

-IC(p'.q)(t)  =  q(x|y)xp(y).  □ 

In  our  example,  p'l  =  JC(p'2.Pi).  and  the  database  {p'l-p'a-P’a}  's  consistent. 

It  follows  from  Theorem  2  that; 

Corollary  3  Let  C  =  (7ri(p-)nj(,)(p'))'  Then,  where  d  is  directed  divergence. 

JC(p'.q)  =  arg  min  d(q*.q).  □ 
q*ec 

It  follows  from  the  definitions  of  Jeffrey  conditionalization  and  pairwise  join  that 
Theorem  4  JC(p'.q)  =  ;rs(,)(PJ(p',q)).  □ 

Corollary  5  s(p')ns(q)  =  0  implies  JC(p'.q>)  =  q.  □ 

Notice  that  the  Jeffrey  update  of  q  by  p',  the  projection  onto  s(q)  of  the  join  of  q  and  p',  is 
analogous  to  the  semijoin  of  one  relation  by  another.  The  semijoin  of  a  relation  r2  by  a  relation  r^  is 
SJ(ri.r2)  =  ’r5j^jj{ri*r2).  where  'tt'  denotes  relational  projection  and  denotes  natural  join  [32]. 

A  mapping  from  a  probability  distribution  to  a  relation  with  the  same  scheme  that  possesses  a 
number  of  desirable  properties  [6.30]  is;  tp;.(p)(t)  =  ceil(p(t)).  Let  Rj[;.|  denote  the  set  of  all  non¬ 
empty  relations  over  dom(s(r));  Rj^^j  =  —  {0}.  It  follows  from  results  in  [6]  that  tp^  is  a 

homomorphism  from  (Ps(p)  xP^j,).  Psic;)-  JC)  to  (Rs(p)  ><^5(17)-  RjI?)-  SJ). 

Theorem  6  tp;.(JC(p.q))  =  SJ(tp;.(p).tp^(q)).  □ 

Additional  forms  of  replacement  may  be  identified.  It  is  common  practice  [3]  to  adjust  a  relative 
frequency  distribution  p  by  replacing  it  with  p'  =  J{jrx(p)).  where  X  is  a  suitable  model  of  s(p).  The 
primary  motivation  for  such  adjustment  is  the  elimination  of  sampling  zeros  (relative  frequency 
values  of  zero  that  are  due  merely  to  the  ratio  between  sample  size  and  |dom(s(p))|).  However, 
experimental  studies  [16.27]  have  shown  that,  for  suitable  X.  J(jrx(p))  tends  to  be  closer  than  the 
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relative  frequency  distribution  p  is  to  the  probability  distribution  generating  the  observations  from 
which  p  is  estimated.  Identifying  an  optimal  model  X  is  difficult,  and  involves  a  tradeoff  between  the 
smoothing  achieved  by  increasing  the  refinement  of  the  model  and  the  loss  of  information  incurred  in 
doing  so;  if  X<Y.  then,  where  d  is  relative  information  and  u^i^)  is  the  uniform  distribution  over 
dom(s(p)): 

Theorem  7  d(J(jrx(p)).U5(p))  <  d(J(5ry(p)).u^(p)).  □ 

Theorem  8  d(p.J(7rx(p)))  >  d(p.J(7ry'(p))).  □ 

An  additional  consideration,  when  a  relative  frequency  distribution  p  to  be  adjusted  is  an  element 
of  a  probabilistic  database  K.  is  the  additional  effort  required  to  resolve  inconsistencies  in 
^temp  _  (K— {p})u{J(jr;f(p))}.  Ways  of  minimizing  this  effort  are  suggested  by  the  following 
results. 

Lemma  9  ACB  implies  5ry((;rfl(p))  =  7ry((p).  □ 

Theorem  10  Let  p'  =  J(;r;f(p)).  For  all  WeX  and  VCW.  irvi'?')  =  ’'"vIp)- 

Proof:  By  the  definition  of  Join.  p'€E(jr;f(p)).  Then,  by  the  definition  of  extension.  5rx(p')  =  ’r;f(p). 
By  the  definition  of  projection  onto  a  structure.  7riv(p')  =  7riy(p).  for  all  WgX.  If  VCW. 
’rv(’rw(p'))  =  ’rv(’rw(p))-  By  Lemma  9.  ffv/(p')  =  jrv^(p).  □ 

Let  Y  =  {s(p)ns(q)|qeK.  q^^p}.  It  follows  from  Theorem  10  that  if  K  is  consistent  and  s{K)  is 
a-acyclic.  K'®'”'’  is  guaranteed  to  be  consistent  if  X  is  a  coarsening  of  the  reduction  of  Y  [7],  i.e.,  of 
Y  -  {V|V€Y,  V'€Y.  VcV'}.  For  example,  with  K  =  {qi,q2.q3}.  s(qi)  =  {w.x.y.z}.  s(q2)={x.y,u}, 
and  s(q3)={y.v}.  where  q^  is  to  be  adjusted,  if  the  search  is  restricted  to 
{{x.y}}  <  X  <  {{w.x.y.z}},  then  (K— {qi})u{J(jrx(qi))}  will  be  consistent.  Even  with  this  restric¬ 
tion.  there  remain  eight  models  (other  than  {w.x.y.z}.  which  will  yield  q'i=qi)  to  choose  from. 

This  form  of  adjustment  is  applicable  only  to  p  such  that  |s(p)|  >  1.  For  such  p.  let 

X  =  {W|Wcs(p),  |W|  =  |s(p)|-l}. 

Then,  if  K  is  consistent  and  s(K)  is  a  model  and  is  a-acyclic.  it  follows  from  Theorem  10  that 

(K-{p})u{J(7r;,(p))} 

is  consistent. 

This  adjustment  can  be  performed  in  parallel  on  any  number  of  database  elements.  From  the 
table  below,  each  of  the  four  databases  {ps.  pg}.  {p'j,  pg}.  {ps.  p'g}.  and  {p'5.  p'g}  is  consistent, 
since  the  projections  onto  {V2}  of  the  four  distributions  are  the  same. 


Wi 

V2 

Ps 

P'5=J(’r{{r,}.{^}}(P5)) 

V2 

Vj 

Pi 

P'l=-l("'{{,,4}.{n)}(Pl)) 

0 

0 

1/4 

3/16 

0 

0 

1/8 

3/32 

0 

1 

1/2 

9/16 

0 

1 

1/8 

5/32 

1 

0 

0 

1/16 

1 

0 

1/4 

9/32 

1 

1 

1/4 

3/16 

1 

1 

1/2 

15/32 

So.  some  degree  of  smoothing  can  always  be  achieved,  without  the  need  to  form  scheme  inter¬ 
sections  and  search  a  (sub)lattice  of  models.  Note,  though,  that  when  |s(p)|  >  2,  X  as  defined  above 
is  a-cyclic  and  iterative  proportional  fitting  is  required  to  calculate  J(x;(^(p)). 

Not  all  smoothing  procedures  in  use  preserve  projections  in  this  way.  Convex  smoothing 
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replaces  p  with  Ap  +  (l-A)uj(p),  where  Ag[0.  1],  The  value  of  A  may  be  selected  to  minimize  the 
expected  squared-error  distance  from  the  underlying  unknown  distribution  generating  the  observed 
frequencies  [9J.  There  is  a  strong  analogy  between  selecting  a  Ag[0.  1]  and  selecting  a  model 
{0}<X<{s(p)}:  when  A=0  and  X={0},  or  when  A=1  and  X={s(p)},  Ap  -f 

(l-A)uj(p)  =  J(x-x(p)):  also,  corresponding  to  Theorem  7, 

A'<A  implies  d((A'p  -1-  (l-A')us(p)).  u^jp))  <  d{(Ap  +  {l-A)uj(p)).  Uj(p)). 

But  unlike  model-fitting,  convex  smoothing  is  not  guaranteed  to  preserve  projections  even  onto  single 
variables.  Suppose  p''5  =  l/2xp5  -f  l/2xUi[pj).  from  the  table  above.  Then 

^{.,}(p''5)(0)=3/8^1/4=;r{i.^}(p5)(0). 

Instead  of  replacing  p  by  J(jr;f(p)),  for  some  X,  p  may  be  replaced  by  the  set  ^xip)  itself.  There 
are  two  scenarios  in  which  this  might  be  reasonable. 

First,  it  may  be  determined  that  p  is  reconstructable  from  [5]  (resp.,  approximately  reconstruct- 
able  from)  X:  p  =  J(x-x(p))  (resp..  d(p.J(jrx(p)))  <  e).  It  is  possible  to  achieve  significant  data 
compression  by  such  replacement.  Suppose  that  X  =  {v|vGs(p)},  |s(p)|  =  n,  and  |dom(v)|  =  k  for 
all  v€s(p).  Then  k"  —  l  numbers  are  required  to  represent  p.  versus  n[k—l)  for  xx^(p). 

Second,  one  may  decide  that  one  has  insufficient  confidence  in  the  assessment  or  estimation  of 
some  pGK.  If  p  is  a  relative  frequency  distribution  based  on  N  observations.  /V/|dom(V)|  will  be 
larger  than  /\//ldom(s(p))|  for  each  V€X<{s(p)}.  The  estimates  ffi/(p)  will  be  more  reliable  than  the 
estimate  p.  Or.  an  expert  may  have  been  called  upon  to  assess  probabilities  over  s(p):  actions  taken 
based  on  this  assessment  turn  out  badly:  when  pressed  on  the  matter,  the  expert  admits  that  she  is 
knowledgeable  regarding  only  the  events  in  certain  sets  dom(V).  where  Vcs(p). 

It  follows  from  Theorem  11  below  that  in  each  of  these  cases,  if  K  is  consistent  then  so  is 
(K— {p})Uxx(p).  regardless  of  the  cyclicity  of  s(K),  since,  if  X  is  a  model  of  s(p). 
(s(K)-{s(p)})uX  <  s(K). 

Theorem  11  [6]  X<Y  implies  E(xj^(p))  C  E(7rx(p)).  □ 

Conversely,  K*CK  may  be  replaced  with  J(K*).  One  may  wish  to  estimate  probabilities  for 
events  in  dom(V*),  where  there  is  no  Vgs(K)  such  that  V*CV,  but  Since 

7rs(/(»)(J(K*))  =  K*.  no  information  is  lost  by  replacing  K  with  (K— K*)u{J(K*)}.  But  for  purposes 
of  decision  making,  there  is  no  need  to  construct  such  an  estimate  unless  one  is  a  strict  Bayesian 
[17.28].  Also,  unless  s(K)  is  /^acyclic  (each  submodel  of  K  is  a-acyclic  [7]).  consistency  of  K  does 
not  guarantee  consistency  of  (K— K*)u{J(K*)}. 

V.  Propagating  Changes  in  Probability 

We  now  turn  to  the  problem  of  updating  an  entire  database  K  following  receipt  of  information  in 
one  of  the  forms  (i).  (//).  or  (///)  discussed  in  Section  III. 

How  we  should  proceed  seems  to  depend  on  our  attitude  toward  K.  In  the  literature  on  the 
management  of  uncertainty,  the  predominant  view  is  that  a  single  joint  probability  distribution  over 
dom(s(K))  would  be  obtained,  stored,  and  updated  as  appropriate,  if  it  were  possible  to  do  so 
[11.18,24],  There  are  at  least  two  reasons  why  it  may  not  be  possible. 
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First,  as  discussed  above,  maintaining  a  joint  distribution  p  will  require  the  determination  and 
storage  of  exponentially  more  numbers  than  a  collection  ^x(p)  marginal  distributions,  if  X  is 
sufficiently  refined. 

Second,  there  are  many  situations  in  which  probabilities  can  or  should  be  assessed  only  over 
subsets  of  the  set  of  variables  over  which  one  would  like  to  have  a  joint  distribution.  To  illustrate, 
imagine  a  study  in  which  probabilities  of  occupation  of  the  elements  of  a  3-dimensional  grid  are  to  be 
estimated  for  an  object  that  can  detect  other  objects  only  in  the  {X,  Y}  plane.  Therefore,  in  order 
not  to  disrupt  the  object's  usual  pattern  of  movement,  frequencies  are  observed  and  recorded  for  the 
{Y.  Z}  and  {X.  Z}  planes  only.  In  general,  there  will  be  infinitely  many  distributions  over  {X.Y.Z} 
having  the  estimated  distributions  as  marginals;  if  no  additional  information  is  available  (e.g..  location 
along  the  X  coordinate  is  probabilistically  independent  of  location  along  Y,  given  the  location  along  Z) 
then  it  seems  that  there  is  no  good  reason  to  single  out  any  one  of  them. 

Now.  let  us  distinguish  between  local  and  global  database  updates.  In  both  cases,  the  entire 
database  K  is  updated  to  K'.  But  in  a  local  update,  changes  are  made  to  one  or  more  elements  of  K 
and  these  changes  are  propagated  to  the  other  elements.  In  a  global  update,  some  element  p  of  E(K) 
is  selected  and  updated  appropriately  to  p';  K  is  then  updated  as  K'  =  ’r;(K)(p')- 

A  global  update  is  always  computable,  if  K  is  consistent,  regardless  of  the  cyclicity  of  s(K).  as 
follows: 

1.  Form  J(K) 

2.  Update  J(K)  to  J(K)' 

3.  Form  K'  =  7r,(K)(J(K)'). 

But  the  reasons  that  would  lead  one  to  maintain  a  database  K  of  distributions,  versus  a  single 
distribution  whose  projection  onto  s(K)  equals  K.  are  reasons  for  rejecting  this  global  update  scheme; 
computation  of  J(K).  and  thus  7r5|/(|(J(K)').  may  be  infeasible;  K  may  have  the  structure  s(K) 
merely  because  of  constraints  limiting  the  sets  of  attributes  over  which  it  is  possible  to  obtain  joint 
probabilities;  and  the  dependency  relations  from  which  it  may  be  deduced  that  the  actual  distribution 
over  dom(s(K))  is  J(K)  are  either  not  assumed  to  hold  or  are  explicitly  rejected. 

Local  updating  schemes  are  discussed  in  the  literature  on  causal,  or  Bayesian,  networks 
[11.18.24].  These  networks  are  directed  acyclic  graphs  each  node  of  which  represents  a  single  vari¬ 
able.  and  whose  arcs  represent  dependency  relations  between  variables.  Associated  with  each  variable 
V  is  a  matrix  containing  probabilities 

p(v=x|t[V]=w) 

for  each  xGdom(v)  and  w€dom(V),  where  V  is  the  set  of  variables  from  which  there  is  an  arc  to  v. 
In  combination  with  the  dependency  information  encoded  in  the  arcs,  these  matrices  determine  a  sin¬ 
gle  joint  distribution  over  all  the  variables  in  the  network.  Despite  this,  the  schemes  in  use  for  updat¬ 
ing  networks  are  strictly  local. 

An  early  algorithm  applicable  to  databases  whose  schemes  do  not  reduce  to  graphs  is  presented 
by  Lemmer  [20].  Although  the  algorithm  has  the  advantage  that  it  does  not  need  to  utilize  assump¬ 
tions  of  probabilistic  independence,  the  r-graph  of  the  database  must  be  a  forest.  The  r-graph  of 
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K  =  {pi . Pn}  is  an  undirected  graph  with  node  set  {s(pi) . s(p„)}  and  edge  set 

{<V,.Vy>|V,-nVy  ^  0;  [V*nV,  0.  V*nVy#0.  V/OVyCV*]  ^  V;t^s(K)}. 
where  V/,  Vy,  and  VJ^  are  distinct  sets  of  variables,  of  s(K).  To  update  K  in  response  to  an  update  of 
PGK  to  p': 

1.  If  s(p')  is  not  the  root  of  the  tree  containing  it.  then  mark  s(p')  as  updated  and  update  its  parent, 
i.e..  the  distribution  q  associated  with  its  parent  s(q)  as  q'  =  JC(p',q).  and  mark  s{q')  as  updated. 
Continue  until  the  root  is  updated. 

2.  Update  each  unmarked  child  c  of  root  r'  as  c'  =  JC(r'.c).  and  mark  s(c'). 

Repeat,  recursively,  for  the  subtree  rooted  at  s(c').  for  each  updated  child  c'. 

The  resulting  database  is  consistent,  if  K  is  consistent. 

Although  developed  for  multiply-connected  causal  networks  (where  more  than  one  path  exists 
between  at  least  one  pair  of  nodes),  there  exist  local  propagation  methods  applicable  to  databases 
whose  r-graphs  are  not  forests,  but  whose  schemes  are  a-acyclic.  These  are  discussed  in  detail  in 
[18], 

For  databases  whose  schemes  are  not  oi-acyclic.  local  updating  seems  to  be  out  of  the  question. 
Consider  the  consistent,  but  a-cyclic.  database 


Vi 

V2 

Pt 

V3 

Pu 

Vl 

Vj 

Pii 

0 

0 

3/8 

0 

0 

3/8 

0 

0 

1/8 

0 

1 

0 

0 

1 

3/8 

0 

1 

1/+ 

1 

0 

3/8 

1 

0 

1/8 

1 

0 

3/8 

1 

1 

1/4 

1 

1 

1/8 

1 

1 

l/< 

Suppose  the  information  vi=l  is  received.  Distributions  pg  and  pu  are  updated  independently  by 
Bayesian  conditionalization: 

Vi  V:  P\ _ Vj  P'li 

0  0  0  0  0  0 

0  10  0  10 

1  0  3/5  1  0  3/5 

112/5  11  2/5 

Depending  on  whether  the  (Jeffrey)  update  of  pio  is  by  p'g  or  by  p'u.  a  different  result  is  obtained; 

V2  V3  JC(p',.Pi,)  JC(p^),Pio) 

0  0  3/10  9/20 

0  1  3/10  3/10 

1  0  2/10  3/20 

1  1  2/10  1/10 

Further,  as  can  be  determined  by  linear  programming.  E({p9.pio.pii})  has  a  single  element;  it  and  its 
update  on  the  information  vi=l  are; 
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Vl  V2  V3  P  P' 

0  0  0  1/8  0 

0  0  11/40 

0  10  0  0 
0  110  0 

1  0  0  1/4  2/5 

10  11/8  1/5 

110  1/8  1/5 

1111/8  1/5 

As  is  easily  verified,  the  projection  of  p'  onto  {V2.V3}  is  distinct  from  both  JC(p'9.pio)  and 
JC(p'ii.Pio)! 

VI.  A  Problem  for  Enemies  of  the  M.E.P. 

There  has  been  much  debate  regarding  whether,  when  information  regarding  joint  probabilities 
does  not  determine  those  probabilities  uniquely,  the  ambiguity  should  be  resolved  by  invoking  the 
maximum  entropy  principle.  According  to  Jaynes,  its  leading  proponent: 

The  MAXENT  principle,  stated  most  briefly,  is:  when  we  make  inferences  based  on  incomplete  information,  we  should 
draw  them  from  that  probability  distribution  that  has  the  maximum  entropy  permitted  by  the  information  we  do  have 
[12.  p.  940], 

For  a  consistent  database  K.  the  convexity  of  E(K)  guarantees  the  existence  of  a  unique  pGE{K) 
with  maximum  entropy.  The  arguments  for  and  against  maximizing  entropy  are  well-known  [12.31]. 
The  present  author  has  sided  with  the  dissenters  in  this  debate,  and  has  advocated  methods  of  infer¬ 
ence  and  decision  making  with  various  types  of  constraints  on  underdetermined  probabilities  that  do 
not  utilize  any  particular  representative  of  the  set  of  joint  probability  distributions  satisfying  the  con¬ 
straints  [17.28].  However,  if  a  database  K  is  updated  locally  using  Bayesian  methods,  the  result  is 
indistinguishable  from  a  global  update  utilizing  the  maximum  entropy  element.  J(K).  of  E(K). 
Further,  it  is  not  the  case  that  for  all  p.q€E(K).  Jrj(K)(p’)  =  ’'■s(x)(q')- 

To  illustrate,  recall  the  example  of  the  object  whose  movements  could  be  observed  only  in  two 
planes.  Suppose  that  the  true  probabilities  of  location  in  three  dimensions  are: 

X  Y  Z  p 

000  1/16 
0  0  1  0 

0101/8 
oil  3/16 
1  0  0  5/16 

10  11/8 
1101/8 
1  1  1  1/16 

The  2-dimensional  projections  of  p.  i.e..  the  data  actually  obtained,  are: 


Y 

z 

Pt2 

X 

z 

Pl3 

0 

0 

3/8 

0 

0 

3/16 

0 

1 

1/8 

0 

1 

3/16 

1 

0 

1/4 

1 

0 

7/16 

1 

1 

1/4 

1 

1 

3/16 
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It  is  determined  that  the  probability  of  Y=0  should  be  3/4,  not  1/2.  The  Jeffrey  update  of  pi2.  p'i2. 
and  the  Jeffrey  update  of  p^a  by  p'12  are: 

Y  Z  X  Z  JC(p',;.p„) 

0  0  9/16  0  0  33/160 

0  1  3/16  0  1  5/32 

1  0  1/8  1  0  77/160 

111/8  11  5/32 

The  Jeffrey  updates,  on  the  information  that  the  probability  of  Y=0  is  3/4,  of  p  and 
J(’r{{K.z}.{xz}}(p))-  are: 


X 

Y 

z 

p' 

J({Pl2Pu}) 

0 

0 

0 

3/32 

27/160 

0 

0 

1 

0 

3/32 

0 

1 

0 

1/16 

3/80 

0 

1 

1 

3/32 

1/16 

1 

0 

0 

15/32 

63/160 

1 

0 

1 

3/16 

3/32 

1 

1 

0 

1/16 

7/80 

1 

1 

1 

1/32 

1/16 

Observe  that  rr{{K2}.{X2}}(J{{Pi2.Pi3})')  =  {p'u  P'ls}.  but  Jr{{K.z}.{xz}}(p')  ^  {p'lz  P'is}.  even 
though  we  have  postulated  that  {P12.P13}  =  ’'■{{k.z}.{xz}}(p)-  More  generally. 

Theorem  12  Where  q'  is  a  Jeffrey  update  of  q  originating  in  some  VCs(qi)Cs(q). 
for  arbitrary  distributions  qi  and  q2.  □ 

Remark:  Jeffrey  updating  reduces  to  Bayesian  conditioning  as  a  special  case. 

This  result  poses  a  serious  philosophical  problem  for  those  maintaining  a  database  K  of  probabil¬ 
ity  distributions  who  wish  both  to  reject  the  assumptions  that  would  pick  out  the  maximum  entropy 
element  of  E(K)  as  the  Joint  distribution  of  which  K  is  a  projection  and  to  update  the  database  in 
response  to  new  information.  Local  Bayesian  updating  of  K  results  in  the  same  distribution  that 
results  from  forming  the  maximum  entropy  extension  of  K,  updating  the  extension,  and  projecting 
the  update  onto  s(K). 

VII.  Conclusion  and  Open  Problems 

Probabilistic  databases  may  be  updated  to  accommodate  new  information,  where  the  new  infor¬ 
mation  can  take  various  forms.  At  one  extreme,  entire  distributions  may  be  added  or  removed:  at  the 
other,  the  probability  of  an  event  in  the  domain  of  a  single  distribution  is  revised.  Databases  may 
also  be  adjusted  to  smooth  the  effects  of  sample  size  limitations,  for  probabilities  estimated  from  fre¬ 
quencies.  In  each  of  these  cases,  local  changes  must  be  propagated  throughout  the  database  in  order 
to  restore  consistency. 

The  method  proposed  for  modifying  one  distribution  in  response  to  changes  in  another  is  based 
on  Jeffrey's  Rule  [13].  It  is  noted  that  Jeffrey’s  Rule  requires  the  assumption  of  constancy  in  certain 
conditional  probabilities:  without  this  assumption,  the  update  is  underdetermined,  in  general.  It  is  also 
noted  that  the  use  of  Jeffrey’s  Rule  to  propagate  changes  in  probability  is  equivalent  to  projecting 


6-16 


onto  the  scheme  of  a  database  the  update  of  its  join.  This  is  troubling  for  those  who  are  philosophi¬ 
cally  uncomfortable  with  the  maximum  entropy  principle. 

The  distinction  in  the  literature  between  updating  and  revising  a  knowledge  base  is  mentioned. 
The  distinction  seems  less  critical  for  probabilistic  databases  than  it  does  for  logical  databases  [14]. 

A  distinction  is  made  between  local  and  global  updates.  Local  updating  (e.g..  by  Jeffrey's  Rule) 
appears  to  be  impossible  for  databases  whose  schemes  are  a-cyclic  [7].  For  such  a  database,  updat¬ 
ing  by  selecting  an  element  of  its  extension,  updating  it,  then  marginalizing,  is  possible  but  computa¬ 
tionally  infeasible. 

Methods  for  updating  interval-valued  probabilistic  databases,  in  which  a  closed  subinterval  of 
[0.  1]  is  associated  with  each  tuple,  remain  to  be  developed.  It  seems  that  these  should  reduce  to 
Bayesian  methods  in  the  special  case  in  which  all  intervals  are  degenerate.  Extension.  Join  and  pro¬ 
jection  operators  for  such  databases  are  defined  in  [25,26].  The  work  of  Fertig  and  Breese  on  interval 
influence  diagrams  [8]  may  be  relevant. 

Another  open  problem  is  to  identify  update  anomalies  for  probabilistic  databases  analogous  to 
those  for  relational  databases.  Probabilistic  analogues  of  relational  data  dependencies  are  defined  in 
[6].  There  may  exist  hierarchies  of  normal  forms  defined  in  terms  of  probabilistic  functional,  join,  and 
multivalued  dependencies  whose  members  possess  varying  degrees  of  resistance  to  whatever  mean¬ 
ingful  anomalies  are  discovered. 

A  third  area  for  future  research  is  the  development  of  ways  to  coordinate  updating  with  planning 
and  decision  making.  Not  all  portions  of  a  database  will  be  relevant  to  all  problems.  Identifying 
relevant  subschemes  is  discussed  in  [29].  Anytime  [4.10]  methods  of  decision  making  with  proba¬ 
bilistic  databases,  in  which  the  number  of  admissible  actions  decreases  with  computing  time,  should 
also  be  explored. 
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Abstract 


In  this  paper,  the  solution  for  a  near-field  polarizing 
layer  facing  an  infinite  array  of  antenna  elements  printed  on 
protruding  dielectric  substrates  is  described.  The  analysis  of 
the  polarizer  is  accomplished  by  using  Floquet  expansion  modes 
for  expressing  the  field  components,  and  the  method  of  moments  to 
enforce  the  conditions  on  the  printed  currents.  The  analysis  and 
code  are  capable  of  representing  fairly  general  single-layered 
polarizer  geometries  (planar  in  x  and  y)  with  currents  printed  in 
both  the  X  and  y  directions,  and  with  a  periodicity  identical  to 
that  of  the  array.  Numerical  values  for  the  element  active 
impedance  and  axial  ratio  are  presented  for  a  design  with 
circular  polarization  characteristics  positioned  on  top  of  arrays 
of  dipoles  with  straight  and  bent  arms  operating  in  the  transmit 
mode.  Reflection  coefficients  from  arrays  of  dipoles  and 
microstrip  patches  illuminated  by  a  plane  wave  are  also 
calculated. 
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ANALYSIS  OF  A  GENERALIZED  POLARIZER  LAYER 
FOR  INFINITE  ARRAYS  OF  ANTENNAS 
ON  PROTRUDING  DIELECTRIC  SUBSTRATES 


iJean- Pierre  R.  Bayard 


Introduction 

Infinite  arrays  of  antennas  printed  on  dielectric  walls 
protruding  from  a  ground  plane  have  been  the  focal  point  of  much 
research  effort  related  to  MMIC  and  Aerospace  applications  [1-7] . 
These  antenna  elements,  whether  they  are  dipoles  as  in  [1-7]  or 
potentially  tapered  slots,  exhibit  because  of  the  array  geometry 
(radiating  currents  perpendicular  to  array  face)  linear 
polarization  characteristics.  In  this  paper,  a  single  dielectric 
sheet  of  printed  gratings  is  added  in  front  of  the  array ^  (see 
Figure  1)  offering  the  possibility  of  altering  its  polarization 
characteristics .  This  alteration  may  take  the  form  of  a 
circularly-polarized  (CP)  wave,  comparable  to  the  performance  of 
the  meander-line  polarizer  of  [8-10] ,  or  a  purely  linearly- 
polarized  (LP)  wave  removing  the  undesirable  cross  polarization. 
The  analysis  /  code  for  describing  the  polarizer  is  the  focus  of 
the  effort,  and  shall  be  considered  a  module  additive  to  the  work 
described  in  [3-7] . 

In  the  section  that  follows,  the  analysis  of  the  polarizing 
layer  is  formulated  by  using  the  equivalence  principle,  modal 
expansion  techniques,  and  the  method  of  moments.  The  application 
of  the  equivalence  principle  at  the  top  and  bottom  of  the  layer 
follows  step  identical  to  those  described  in  [6],  thereby 
facilitating  the  integration  of  this  module  with  the  previously 
developed  code.  Since  the  structure  is  planar  and  doubly 
periodic  in  nature,  the  fields  from  the  equivalent  sources,  as 
well  as  the  Green's  function  for  the  printed  x  and  y-directed 
electric  currents,  are  expressed  using  typical  Floquet  functions. 
The  method  of  moments  procedure  is  then  used  for  enforcing  the 
electromagnetic  boundary  conditions  on  the  electric  currents  of 
the  polarizer.  The  reader  should  note  that  the  method  can  not 
presently  model  a  CP-producing  meander-line  polarizer .  In  order 
to  achieve  circular  polarization  from  such  polarizer,  the 
gratings  ought  to  be  printed  oblique  to  the  incoming  wave,  i.e., 
the  y-directed  wave  emanating  from  the  antenna  of  Figure  1,  and 
designed  such  that  the  wave  components  that  are  perpendicular  and 
parallel  to  the  meander-line  axis  are  phased  90  degrees  apart. 
While  the  present  work  can  not  treat  polarizer  currents  printed 
on  a  tapered  domain,  as  would  be  required  by  the  meander-line 
design,  it  offers  a  moment-method  based  approach  which,  in 
conjunction  with  [6],  pemits  the  modeling  of  a  wide  variety  of 
polarizer  designs  (with  currents  defined  on  rectangular  domains) 
when  facing  the  array. 

Lastly,  numerical  results  are  presented  for  infinite  arrays 
of  microstrip-fed  dipoles  with  straight  and  bent  arms.  The 
physical  dimensions  of  the  polarizer  and  its  substrate's 
thickness  and  permittivity  are  adjusted  for  achieving  circular 
polarization  near  broadside.  The  active  impedance  and  axial 
ratio  curves  are  plotted  versus  scan  and  frequency.  Plane  wave 
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reflections  from  dipole  and  microstrip  arrays  are  also  considered 
as  limiting  test  cases  for  verification  purposes. 


FORMULATION 

The  geometry  of  the  polarizer  is  shown  in  Figure  2a.  The 
bottom  dielectric  layer  is  a  substrate  with  a  relative 
permittivity  6p,  thickness  tp,  upon  which  x  and/or  y-directed 
electric  currents  can  be  printed.  The  top  layer  or  supers trate 
with  a  permittivity  and  thickness  t^  can  serve  as  a  protective 
radome  cover  or  its  permittivity  may  be  set  to  that  of  free- 
space.  The  goal  of  the  formulation  is  to  determine  matrix 
equations  expressing  the  self  and  mutual  impedances  of  the 
electric  currents  printed  a  z=Zp+tp,  as  well  as  the  coupling 
between  these  currents  and  sources  located  outside  the  polarizer, 
i.e.,  z<Zp  and  z>Zp+tp+td.  Applying  the  equivalence  principle  at 
z=Zp  and  z=Zp+tp+td,  we  set  up  a  model  for  the  regions  exterior  to 
the  polarizer  but  included  in  the  unit  cell  (see  Figure  2b) .  For 
the  purpose  of  this  work,  the  part  of  the  model  represented  by 
the  equivalent  magnetic  sources  at  z=Zp-6  and  z=Zp+tp+td+6  (6 
indicating  a  limiting  quantity)  is  assumed  known  from  [6] ,  thus 
will  not  be  included  in  the  analysis.  In  order  to  characterize 
the  region  of  the  model  which  lies  inside  the  polarizer,  the 
electric  current  problem  and  the  magnetic  current  problem  are 
formulated  separately  using  the  TM^  and  TE^  mode  sets.  For  the 
electric  currents  inside  the  parallel-plate  guide  of  Figure  2b, 
the  first  step  is  to  determine  the  Green's  functions  for 
infinitesimal  x  or  y-directed  currents  printed  at  (Xo,yo,  Zp+tp)  . 

Let 


where 


ki=^epkl-u\-vl, 

Uj  =iCgS  in0co  s(j) + 

= JCgS  in9  s  incj) + 


1271 

a 

n27c 

b 


g-icr,u-^o)cosicf(z-Zp) 


(1) 


0,  4)  are  the  beam  steering  angles,  ko  is  the  free-space  wave 
number,  and  a  and  b  are  the  unit  cell  dimensions  in  the  x  and  y 
directions,  be  the  TM^  and  TE^  mode  functions  for  the  polarizer 
substrate  (Zp  ^  z  <  Zp+tp)  .  Similarly,  let 

Et-  e  cos*/  ( Z-Zj- 1„-  tj) 

where 

k^=sleJrl-ui-Vn, 


be  the  TM^  and  TE^  functions  for  the  superstrate.  The 
determination  of  the  Green's  functions,  i.e.,  the  solution  for 
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(b)  Equivalent  model  with  magnetic  sources 


Figure  2.  The  schematic  and  equivalent  model  for  the  polarizing  layer. 
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the  modal  coefficients  ,  is  accomplished  by- 

enforcing  the  continuity  of  the  electric  field  components 
tangential  to  the  interface  at  z=Zp+tp,  as  well  as  the  magnetic 
field  jump  discontinuity  caused  by  the  delta  current  sources. 
The  result  is  given  by  the  following  matrix  equations : 


-jV’aSin(Jc/t„) 

- — - — a^kisinik^t  ) 

-jVpSin(*/tp) 

n 

— ^  vyc/sin  ( Jc/t,,) 
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— La  ic/cos  (Jc/tj) 

J  VpCOS  (/c/tp) 

—  U^kfcos  (kfcj 

G)|i  ^  ^ 

where  G3  =l/(ab)  for  x-directed  currents,  64=1/ (ab)  for  y- 
directed  currents,  and  0  otherwise.  From  equations  (1-3),  any 
field  component  produced  by  an  x  or  y-directed  current  can  be 
expressed  straightf o2n/jardly  by  convolving  the  current 
distribution  with  the  appropriate  Green's  function  component. 

For  the  equivalent  magnetic  sources  inside  the  polarizer, 
let  the  TM^  and  TE^  mode  functions  be 


(4) 


for  the  region  defined  by  Zp  ^  z  ^  Zp+tp,  and 
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(5) 


for  Zp  +tp^  z  ^  Zp+tp+t^.  Enforcing  the  continuity  of  the 
tangential  electric  and  magnetic  field  components  at  z=Zp+tp 
gives 
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1 

2 

1  y 

reducing  equations  (4)  and  (5)  to  having  only  four  modal 
coefficients,  Ap„i,  ,  which  will  be  among  the  unknowns 

to  be  determined  in  the  subsequent  moment  method  formulation. 
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Method  of  Moments 

At  this  time,  the  formulation  has  accounted  for  most  of  the 
electromagnetic  boundary  conditions  governing  the  fields  produced 
by  the  electric  and  equivalent  magnetic  sources  inside  the 
polarizer.  The  remaining  conditions  are  those  existing  on  the 
domains  of  the  sources,  i.e.,  at  z=Zp,  and  on  the 

polarizer  conductor  printed  at  z=Zp+tp.  Consider  a  current 
distribution  of  the  form 

J^{x,y)--xJ^{x,y)  +yJ^{x,y) 


existing  on  a  perfectly-conducting,  zero- thickness  geometry 
printed  at  z=Zp+tp.  With  the  aid  of  the  moment  method,  the 
distribution  of  (7)  can  be  determined  by  means  of  a  finite  sum 
approximation.  Let 


=En=l 


sink^{hn-\x-Xn\) 

WnSink^hn 

sink^{h^-\y-y^\) 

W^sinkji^ 


(8) 


where  x^,  yn  are  the  coordinates  of  the  mode  center,  W^  is  the 
mode  width,  h„  is  the  mode  half-length,  a.nd  a^,  b„  are  the  unknown 
coefficients  defining  the  function.  As  in  [6],  adjusting^ 
individual  mode  dimensions  and  location  permits  the  modeling  of  a 
wide  variety  of  polarizer  geometries  describable  in  the 
rectangular  coordinate  system.  With  respect  to  the  current 
distributions  at  the  two  apertures  at  z=Zp  and  z=Zp+tp+td,  by 
expressing  their  fields  as  modal  sums,  we  have  in  effect  expanded 
these  currents  in  terms  of  Floquet  basis  functions  with  unknown 
coefficients  A^n^,  and  D^ni*  Also,  when  implementing  the 

formulation  numerically,  the  magnetic  current  summations  are 
truncated  to  user-specified  integer  values,  say  -NM^  n  ^NM  and 
PP  ^1  ^PP.  As  in  the  Galerkin's  procedure,  the  solution  for 
these  6  unknowns  is  achieved  by  testing:  a)  The  x  or  y-directed 
electric  field  at  z=Zp+tp  with  x  or  y-directed  electric  current 
modes,  and  b)  the  x  or  y-directed  electric  and  magnetic  fields  at 
z=z  and  z=Zp+tp+td  with  the  conjugate  of  the  Floquet  functions 
across  the  apertures.  For  the  conditions  at  z=Zp+tp,  we  have: 
x-Directed  electric  field  ecaiation 


[ZP^]  [aj  +  [ZP^]  [bj  +  [ZP^^] 


[A/il 

[Q^i] 

[Dn^l] 


=0 
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where 


7-8 


•7PXX_ 

^mq 


wi  th 


T2{q)=- 


sin  {kitp)  Tj^iq)  (m)  ( g)  Tz  im) 

_2_gjv„y5sin 

/ 

\  "  2  J 

cosiu^hg)  -cos  (kghg) 

WqSinkghg 

kl-ul 

(10) 


NNP  and  NLP  are  user-specified  summation  limits  for  the  parallel- 
plate  Green's  functions,  m,q=  1,2,  ...,  NPX,  and  the  star 
denoting  the  complex  conjugate  of  the  quantity. 
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COS  iVJi^)  -cos  ikji^) 

W^sinkji^ 

kl-vl 

(11) 


and  q=l ,2,  , NPY, 


—  ( Uj^ki)  Tl  {m)  t;  {m)  , 

JC06^6p 

with  -NM-<n:<NM,  -PP:<1:<PP,  and  q={l+PP)  (2NM+1) +n+NM+l , 

im)  t;  [m)  , 

JweoS 

with  g={2m+l)  {2PP+1)  +{1+PP)  (2m+l)  +n+NM+l,  (12) 

ijV^)  T*  iw)  t;  im)  , 

with  g=2{2NM+l)  {2PP+1)  +{1+PP}  {2m+l) +n+m+l , 
and 

zSq^=  ijvj  [m)  t;  [m)  , 

with  g=3  (2i\M+l)  (2PP+1)  +{i+PP)  {2Mf+l)  +n+NM+l. 

Realize  that  the  modal  coefficients  in  equations  (10)  and  (11) 
represent  the  now  known  Green's  function  coefficients  for  the  x- 
directed  source  [eq.  (10)]  and  the  y-directed  source  [eq.  (11)]. 
Also,  the  index  q  in  equation  (12)  indicates  the  ordering  of  the 
unknowns  used  in  the  magnetic  current  problem. 

v-Directed  electric  field  equation 
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[aj  +  [zpyy]  [bj  +  [zpy^] 


=0 


(13) 


[Cn^l] 

[Dn^l] 


where 


•rpyx- 

^mq 


Emp  ^-knli 

n=-mp  2^1=- 


(xiejE.p^ri=-im> . 


■NLP 


sin  {kitp)  (g)  r/  (/n)  (g)  Tl  (m)  , 

wi  th  m=l  ,2,  ... ,  NPY  and  g=l ,  2 ,  .  .  . ,  NPX, 


(14) 


r^pyv- 

^mq 


sinikitj  TAg)  r/  (m)  r,  (g)  r;  (m)  , 


wi  th  g=l  ,2,  ... ,  NPY, 


(15) 


Z^q"  =  -^7^  (  ^3*  (^)  T*,  {m)  , 

with  -NM^n:iNM,  -PPiiiPP,  and  q=  {1+PP)  {2NM+1) +n+NM+l, 

[m)  r;  {m)  , 

with  q={2NM+l)  {2PP+1) +{1+PP)  {2NM+1) +n+NM+l,  (16) 

=  -  ^3*  ' 

with  q=2{2NM+l)  (2PP+1)  +  (i+PP)  (2iW+l) +n+Mr+l , 
and 

ZSP=-  {jUj)  {w)  t;  {m)  , 

with  q=3{2NM+l)  (2PP+1) +(i+PP)  {2NM+1)  +n+NM+l . 

For  the  conditions  at  z=Zp  and  z=Zp+tp+t(j,  the  goal  is  to  simply 
express  in  matrix  form  the  fields  produced  by  the  sources  inside 
the  polarizer  at  these  apertures.  Since  the  x  and  y-directed 
electromagnetic  fields  are  tested  with  exp(  jV^y)  exp  ( jUiX)  over 
O^xi^a  and  O^y^b,  the  resulting  matrices  are  highly  sparse, 
particularly  those  coming  from  the  equivalent  magnetic  sources . 
The  matrix  forms  of  the  tangential  field  components  at  z=Zp+6  are 
given  by: 

x-Directed  electric  field 
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[Cn^l] 

Ml 


where 


Jwe^Gp 

with  -NM:<n:<NM,  -PP:<1:<PP,  and  m=q={l+PP)  {2NM+1) +n+NM+l , 

{U,ki)ah, 

(18) 

with  q=[2m*\)  {2PP»1)  +(i+PP)  (2m+l}  +n+i®f+l, 

with  q=2{2NM+l)  {2PP+1)  +  {1+PP)  {2m+l) +n+NM+l , 

Z^^^=jV^ah, 

with  q=2{2NM+l)  (2PP+1)  +  (i+PP)  {2m+l)+n+m+l. 
v-Directed  electric  field 


[2-Lyaa] 


U£1 

IB/il 

lc£l 

IdSi\ 


where 


25"'=— r-i—  (V'A®)  e"*''"ab, 

jcoe^ep 

with  -NM:in<NM,  -PP:<1<PP,  and  m=q=  {1+PP)  {2NM+1) +n+NM+l , 

. {V^k^)ah, 

JWEoS  (20) 

with  q-{2llM*l)  (2PP+1) +(2tPP)  (2iSM+l) +n+J®f+l, 

zS“‘-jU,e^'’’‘-ab, 

with  q=2{2m+l)  (2PP+1) +(i+PP)  (2iVPf+l) +n+Mf+l, 

Z^^^^=-jU,ab, 

with  q=3{2m+l)  (2PP+1) +(i+PP)  (2iVM+l) +rz+Mf+l. 


x-Directed  magnetic  field 
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[Y^^]  [aj  [Y^^y]  [bJ+[Y^^^] 


(21) 


[Bn^l] 

[C/i] 

[Dn^l] 


where 


T,  (g)  r,  (g)  ab, 

JCu^ 

with  -NM:<n<NM,  -PP:<.1<PP,  m=  {1+PP)  {2NM+1) +n+NM+l , 
and  q=l,2,  .  .  .NPX, 


?■«'’'=  ( -3V„E,^r^jV,kiF&)  T,  (g)  T.  (g)  ab, 

with  -NM:in:iNM,  -PPr^liPP,  m=  { 1+PP)  {2NM+1) +n+NM+l , 
and  q=l,2,  .  .  .NPY, 


(23) 


with  -NM-^n-^NM,  -PPii:iPP, 


and  ni=g={l+PP)  {2NM+1) +n+NM+l , 


Y^^^=-jV,ab, 

with  q={2NM+l)  (2PP+1)  +  (i+PP)  {2NM+1)  +n+NM+l , 


y-Lxaa, 

^mq 


jon 


‘‘ab, 


with  q=2{2NM+l)  (2PP+1) +{i+PP)  (22W+1) +rz+Mf+l, 


with  q=M^NM+l)  (2PP+1) +(i+PP)  (2i\/M+l) +n+Mf+l, 


(24) 


v-Directed  magnetic  field 


[A^i] 


[jrLyax]  [jrLyay]  +  [  j^Lyaa] 


[B/i] 

[C/J 


(25) 


where 


(9)  ^2  (9) 

with  -m:in:iNM,  -PP^1<PP,  m=  {1+PP)  {2m+l) +n+NM+l , 
and  q=l  ,2 ,  .  .  .NPX, 
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1 


Y^y^y 

^mq 

wi  th 
and 


=  {ju.ESi-^jVj^iFSi)  T,  (g)  T,  ig)  ah, 

j  0)  u 

-NM<n:iNM,  -PP:<l:iPP,  (l+PP)  {2NM+1) +n+m+l , 
g=l,2,  .  .  .NPY, 


(27) 


with  -NM:in:iNM,  -PPril^PP,  and  in=g=  (1+PP)  {2NM+1) +n+NM+l , 
Y^^^^=jU,ab, 

with  g={2NM+l)  {2PP+1)  +  (l+PP)  (2m+l) +n+NM+l , 


Y^yaa  _  _  1 

^mg 


(28) 


JQJl 

with  g=2  {2NM+1)  (2PP+1)  +(1+PP)  {2m+l)  +n+m+l, 

,Lyaa_  1  tt-  j,P. 


^mq 


j(on 


V^k^ab, 


with  g=3  {2NM+1)  (2PP+1)  +{1+PP)  {2m+l)  +n+m+l. 


The  mathematical  representations  of  the  electromagnetic 
conditions  at  z=Zp+tp+td-6  are  very  much  similar  to  those  in 
equations  (17-28) ,  but  using  the  wave  functions  in  equations  (2) 
and  (5) ,  and  the  relationships  given  in  equation  (6) .  For  that 
reason,  they  are  not  shown  here. 

EXAMPLES 

The  computer  code  developed  from  the  method  described  above 
is  applied  to  a  microstrip  array,  to  an  array  of  dipoles  printed 
on  a  grounded  dielectric  slab,  and  to  a  polarizer  design  facing 
arrays  of  dipoles  on  protruding  dielectric  substrates.  Aside 
from  these  results,  the  following  canonical  problems  were 
successfully  calculated  (but  not  shown  here  due  to  space 
constraint) :  1)  Plane  wave  reflection  from  a  perfectly- 

conducting  surface  with  and  without  a  dielectric  substrate;  2) 
the  element  impedance  of  an  infinite  array  of  center- fed  and  y- 
directed  dipoles  inside  a  free-space  parallel-plate  guide. 

Dioole  gratings 

In  the  first  example,  the  plane  wave  reflection  from  an 
infinite  array  of  short-circuit  dipoles  printed  on  a  grounded 
dielectric  slab  is  considered.  The  dipoles  are  0.10  cm  wide  and 
printed  on  a  90  mil-Noryl  substrate  (6p=2.7)  with  a  1.3  cm  square 
grid.  This  geometry  can  be  modeled  by  simply  setting  Zp  =  0  in 
the  present  module  and  d=0  in  the  work  done  in  [6]  .  Figure  3a 
shows  the  E-plane  phase  of  the  reflection  coefficient  calculated 
versus  dipole  length  at  a  frequency  of  12  GHz,  for  normal 
incidence  and  with  the  reference  plane  at  the  dielectric  -  free- 
space  interface.  Observe  that  as  the  dipole  length  gets  smaller, 
the  phase  value  approaches  99.6  degrees  which  is  the  phase  value 
of  a  grounded  dielectric  slab  alone.  This  example  serves  as  an 
illustration  of  the  versatility  of  such  module  whereby  plane  wave 
reflection  problems  from  planar  geometries  with  or  without  a 
superstrata,  and  with  a  one  (d=0  case  in  [6]  )  or  two-layer  (d^^O 
and  t-*  a  in  [6]  )  substrate  can  easily  be  modeled. 

Microstrio  gratings 
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(a)  Dipole  gratings 


(b)  Microstrip  gratings 


Figure  3  . 


Reflection  coefficient  phase  for  arrays  of  dipoles 
and  microstrip  patches  printed  on  a  grounded 
dielectric  slab. 


In  the  second  example  shown  in  Figure  3b,  the  array  of  open- 
circuit  microstrip  patches  analyzed  and  measured  in  [11]  is 
considered.  The  square  patches  are  1.2  in  wide,  printed  on  a 
tp=0.062  in,  6p=2.47  substrate.  The  unit  cell  dimensions  are 
a=1.8  in  and  b=2.87  in,  matching  those  of  the  waveguide  simulator 
used  in  [11] .  Unable  to  determine  the  reference  phase  value  used 
for  the  data  of  [11] ,  the  author  selected  a  reference  plane  such 
that  his  results  agree  well  with  those  of  [11]  for  2.8  GHz. 

Having  done  so,  the  frequency  is  varied  (keeping  the  reference 
plane  constant)  and  the  phase  of  the  reflection  coefficient  is 
calculated  for  several  frequencies  between  2 . 8  and  3 . 1  GHz .  Good 
agreement  exists  between  our  calculations  and  both  the 
theoretical  and  experimental  values  shown  in  [11] .  The  resonant 
frequency  of  the  array  is  located  near  3  GHz  with  a  fairly  large 
value  of  the  patch's  Q. 

Circular  polarization  gratings 

In  order  to  achieve  circular  polarization  from  the  arrays  of 
microstrip-fed  dipoles  shown  in  Figure  4,  the  polarizer  circuit 
of  Figure  5  is  considered.  Since  the  dipoles  produce  a 
predominantly  linearly-polarized  wave  in  the  y-direction,  it  is 
necessary  for  the  polarizer  circuit  to  couple  some  of  the  y- 
directed  radiation  into  the  x-directed  component,  and  provide  the 
90  degree  phase  differential  between  these  components.  One  may 
view  the  printed  circuit  of  Figure  5  as  a  step  implementation  of 
an  oblique  dipole  capable  of  being  excited  by  both  x  and  y- 
directed  electric  fields .  The  polarizer  substrate ' s  thickness 
and  permittivity  (tp=0.005m,  6p=6.7)  are  adjusted  so  that 
circular  polarization  is  obtained  near  broadside.  In  an  attempt 
to  set  the  phase  reference  at  the  dipole's  apex,  the  voltage 
generator  in  Figure  4  is  moved  from  the  back  wall  junction  (z=0) 
to  the  center  of  the  y-directed  segment  of  the  microstrip  line. 
Also,  to  calculate  the  axial  ratio,  we  express  the  field  radiated 
from  the  face  of  the  array  due  to  each  propagating  Floquet  mode 
as : 


(29) 


Then  the  axial  ratio  is  found  from  [12]  as 


AR=10Log'. 


lo: 


where 


w-y 


(30) 


I^L={-|{l^eP-^|£'J"±VI^9l'  +  l^$l'^2|£'e|‘^|£^l'‘cos[2{p^-Pe)]}} 


In  Figures  6  and  7,  the  E-plane  active  impedance  (Z=R+jX)  is 
calculated  for  both  configurations  of  Figure  4  at  frequencies 
f=285,  300  and  315  MHz.  Both  arrays  are  capable  of  radiating  a 
CP  wave  near  broadside  (see  axial  ratio  curves  in  Figure  8) . 
However,  the  performance  is  good  over  a  small  range,  generally 
less  than  30  degrees.  Note  the  presence  of  a  feed  anomaly, 
evidenced  in  the  impedance  curves,  near  the  location  where  the 
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(b)  Dipole  element  with  bent  arms 


4 .  The  schematic  of  the  dipole  elements  fed  by 

coplanar  stubs  coupled  to  an  excited  microstrip 
line  (dashed  lines) . 
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Y 


Figure  5.  The  sketch  of  the  printed  conductors  of  the  polarizer. 


Figure  6 .  The  element  impedance  of  the  array  of  dipoles  with 

straight  arms  and  covered  with  a  polarizer  versus 
E-plane  scan  (a=b=0.5Xo,  d=0.3A,o,  t=0.01Ao,  e^=2.2, 
f„=300  MHz)  . 
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Figure  7 . 


Figure  8 . 


The  element  impedance  of  the  array  of  dipoles  with 
bent  arms  and  covered  with  a  polarizer  versus  E- 
plane  scan  (a=b=0.5Ao,  d=0.3A,o,  t=0.01Xo,  6^=2. 2, 
f„=300  MHz)  . 


The  axial  ratio  values  of  the  arrays  of  dipoles 
with  straight  and  bent  arms  and  covered  with  a 
polarizer  versus  E-plane  scan  (a=b=0.5Ao,  d=0.3A.o, 
t=0.01Ao,  6^=2. 2,  f„=300  MHz). 
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axial  ratio  deteriorates.  Because  of  the  feed  radiation,  both 
configurations  have  (at  f=300  MHz)  an  E-plane  scan  range  of  about 
30  degrees  to  begin  with.  The  addition  of  a  CP-producing 
polarizer  layer  has  beneficial  effects  only  within  this  range. 
Figure  6,  7  and  8  show  also  the  impedance  and  axial  ratio  values 
for  a  ±5%  shift  in  frequency. 
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Abstract 

A  flexible  model  of  polycrystalline  metallization  on  integrated  circuits  was  developed,  and  the  construction  of  a 
computer  simulation  to  study  electromigration  in  such  structures  has  begun.  This  work  was  part  of  an  ongoing 
effort  to  determine  whether  microcircuit  failure  due  to  electromigration  can  be  reduced  or  eliminated.  Experiments 
have  revealed  that  void  formations  in  aluminum  interconnections  on  integrated  circuits  result  from  current-induced 
transfjort  of  metal  atoms  [1].  A  computer  model  is  being  developed  to  simulate  this  effect.  The  model  will  permit 
variation  of  boundary  conditions,  including  grain  size  and  orientation,  thermal  effects,  crystallite  composition,  as 
well  as  interatomic  potentials.  The  roles  of  grain  boundaries  and  interstitial  atoms  are  of  particular  interest  since 
these  are  parameters  that  can  be  controlled,  to  some  degree,  during  fabrication.  The  computer  code  is  not  complete; 
specific  results  are  not  reported.  However,  progress  was  made  in  the  development  of  the  model,  and  significant 
portions  of  the  code  have  been  written.  A  commitment  to  completion  endures  regardless  of  further  support. 
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A  MOLECULAR  DYNAMICS  SIMULATION  OF  ELECTROMIGRATION 


H.F.  Helbig 
T.Bartelt 

Introduction 

Microcircuit  failure  due  to  current-driven  changes  in  the  interconnections  (traces)  between  circuit  elements  is  a 
serious  obstacle  to  reliable  performance.  Such  failures  are  due  to  the  transport  of  metal  atoms  away  from  their 
original,  more  or  less  uniform,  distribution  along  the  trace;  a  process  called  electromigration.  Operating 
temperature,  thermal  gradients,  mechanical  stress,  current  density,  grain  size  and  structure,  and  defects  (particularly 
vacancies  and  impurities)  affect  failure  rate.  Experimental  studies  to  determine  the  effect  of  each  of  these 
parameters  are  challenging  because  of  the  difficulty  of  sample  preparation  and  the  destructive  nature  of  both  the 
experiment  and  measurement  techniques.  A  realistic,  three-dimensional,  polycrystalline  model  that  permits  control 
over  individual  parameters  can  be  pivotal  to  the  understanding  of  this  phenomenon. 

If  one  represents  the  interactions  between  atoms  by  empirical  potential  functions  (rather  than  doing  ab  initio, 
quantum  mechanical,  many  body  calculations),  it  is  currently  possible  to  handle  the  motions  of  thousands  of  atoms, 
in  reasonable  amounts  of  computing  time,  without  resort  to  super-computers.  Once  the  concepts  described  here  are 
proven  on  commonly  available  machines  (386/486  PCs  and  workstations),  it  will  undoubtedly  be  desirable  to  move 
the  code  to  one  of  the  national  super-computer  facilities  to  study  larger  structures.  The  simulation  code  is  being 
written  in  the  C  programming  language  so  that  it  will  be  easy  to  port  to  a  wide  range  of  computers. 

The  primary  interface  to  the  program  is  a  text  file  containing  information  about  initial  condition,  including  crystal 
structure  and  composition.  The  output  will  be  positions  and  velocities  of  each  of  the  atoms,  directed  to  files  at 
predetermined  intervals.  Additional  modules  will  be  used  to  analyze  the  output  files  to  determine  such  things  as 
diffusion  coefficients  and  failure  rates.  A  graphical  interface  would  also  be  useful,  and  is  being  considered.  Linking 
a  graphical  interface  directly  to  the  main  program,  however,  would  severely  restrict  portability. 

The  principal  objective  of  this  project  is  to  study  the  migration  of  metal  atoms  in  a  polycrystalline  conductor  as  they 
react  to  thermal  and  "electron  wind"  processes.  That  objective  is  to  be  accomplished  in  four  steps  proceeding, 
generally  in  the  following  sequence. 

1.  Completion  of  the  simulation  program,  MDEM 
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2.  Preparation  of  test  sample  configurations 

3.  Collection  of  simulation  data 

4.  Comparison  of  simulation  results  and  experimental  data 
The  sections  to  follow  describe  these  steps  in  detail. 

Step  1.  describes  the  simulation  program  by  subheadings  corresponding  to  the  names  of  the  major  C-code  modules 
from  which  MDEM  is  constructed.  The  functions  that  calculate  the  various  forces  that  act  on  the  atoms  represent 
choices  about  the  fundamental  physical  mechanisms  believed  to  control  atomic  migration  in  a  conductor.  For  that 
reason  those  functions  contain  more  extensive  discussion  than  those  that  attend  mainly  to  bookkeeping  chores. 

Step  2.  demands  Sf>ecial  comment  as  it  includes  an  effort  complementary  to  the  simulation  as  originally  conceived. 
In  particular,  we  propose  to  simulate  the  deposition  of  the  metallization  onto  its  substrate.  This  will  provide  an 
alternative  and,  perhaps,  more  realistic  starting  configuration  for  the  simulated  conductor  than  so  far  provided  by 
MDEM. 

Step  3.  describes  the  sorts  of  data  that  we  plan  initially  to  extract  from  the  simulation. 

Step  4.  is  the  pivotal,  feedback  step.  There  is  already  a  wealth  of  experimental  data  that  document  the  onset  and 
evolution  of  electromigration  damage,  e.g.,  void  formation  and  hillock  growth  [1].  Although  such  data  are 
microscopic  in  scale,  the  smallest  features  are  still  gross  on  the  scale  of  atoms.  The  chief  utility  of  the  simulation 
will  be  to  show  the  connection  between  processes  occurring  at  the  atomic  level  and  the  experimental  data.  We  hope 
that  the  ability  of  the  simulation  to  make  this  connection  will  suggest  new  lines  of  experimental  effort  at  Rome  Lab, 
and  ultimately  provide  both  fundamental  and  practical  information  that  can  be  used  to  reduce  circuit  failures  due 
to  electromigration. 


Technical  Description 


1.  Completion  of  MDEM 

The  main  functional  modules  of  MDEM  are  listed  below  along  with  a  description  of  the  services  they  provide  and 
an  indication  of  their  current  status. 
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input  conTiguration  (outlined;  under  development) 


This  reads  and  parses  the  configuration  input  file  provided  by  the  user.  The  configuration  file  contains  all  the 
information  about  the  atoms  required  for  the  simulation.  Apart  from  data  relevant  to  the  entire  sample  (e.g., 
tempierature),  this  information  is  organized  as  a  set  of  crystallite  descriptions.  The  data  format  is  very  general, 
utilizing  a  system  of  "tags"  so  that  the  configuration  file  can  contain  documentation  that  is  ignored  by  the  parser, 
and  so  that  it  is  easy  to  allow  for  additional  input  parameters,  should  they  become  necessary,  without  invalidating 
earlier  configuration  files. 

Parameters  currently  anticipated  for  the  configuration  file  are  the  system  temperature,  and  for  each  crystallite,  atom 
species,  impurity  species  and  concentration,  vacancy  concentration,  the  shape,  size,  orientation  and  position  of  the 
crystallite,  the  character  of  each  of  its  boundary  planes  (are  the  atoms  near  that  plane  free  or  pinned,  and/or  do  they 
act  as  thermostats?),  the  lattice  vectors  defining  the  shape,  size,  orientation  and  position  of  the  primitive  unit  cell, 
and  the  constants  that  define  the  interaction  energy  between  pairs  of  atoms  and  between  the  atoms  and  the  electron 
gas. 

Choosing  the  parameters  for  the  configuration  file  is  the  present  solution  to  the  "sample  preparation"  problem.  This 
and  an  alternative  solution,  simulation  of  the  film  deposition  process  itself,  are  discussed  in  section  2,  below. 


make  crystailite  (code  written) 

This  uses  the  information  gleaned  by  input  configuration  to  initialize  the  atomic  data  array,  crystallite  by  crystallite. 
The  atomic  data  array  keeps  track  of  the  positions  and  velocities  of  the  entire  set  of  atoms  in  the  sample.  This 
routine  must  do  a  substantial  amount  of  bookkeeping  and  geometric  analysis  to  assure  that  the  user’s  input  data  is 
self-consistent.  Its  construction  consumed  much  of  the  two  month  Summer  Research  Program. 


modify  crystallite  (code  outlined) 

This  is  actually  a  sub-module  of  make_crystallite  that  reprocesses  the  atomic  data  array  to  create  vacancies, 
impurities  or  to  perform  other  operations  that  might  be  desired  as  part  of  the  sample  initialization. 
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monitor 


(code  to  be  written) 


This  is  the  module  that  calls  for  integration  of  the  equations  of  motion  and  monitors  the  elapsed  time  or  other  cues 
that  have  been  requested  by  the  user  to  trigger  the  collection  of  data. 


int^rate  (code  written) 

This  performs  a  numerical  integration  of  Newton’s  equations  of  motion  (F,  =  mja,)  for  each  of  the  atoms  in  the 
sample  using  forces  that  are  described  under  the  next  code  module.  An  efficient  algorithm  (4th  order  Runge-Kutta 
with  adaptive  step-size)  has  been  adapted  for  use  in  MDEM  [2].  The  integration  routine  takes  the  system  from  its 
current  state  (the  positions  and  velocities  of  all  the  atoms  at  time  t)  to  its  state  at  time  t  +  At  under  the  forces 
assumed  to  control  the  atomic  motion. 

Adaptive  step-size  controls  the  size  of  At  so  that  the  error  of  the  numerical  integration  is  maintained  within 
acceptable  bounds;  small  enough  that  the  output  is  free  of  computational  artifacts  and  large  enough  that  the  evolution 
of  the  system  proceeds  at  a  reasonable  rate.  It  is  expected  that  the  natural  time  scale  for  the  system  motion  will 
be  on  the  order  of  the  period  of  lattice  vibrations,  i.e.,  about  100  fs,  so  that  At  will  necessarily  be  somewhat  smaller 
than  this. 

Rather  than  include  the  force  model  in  the  integration  routine,  it  is  embodied  in  a  separate  routine  so  that  new  force 
models  can  be  substituted  easily. 

emt  force  (code  outlined) 

This  provides  the  force  exerted  on  each  atom  according  to  the  recently  developed  "effective  medium  theory.  Forces 
associated  with  thermal  control  of  the  sample  and  with  the  electron  wind  force  presumably  responsible  for 
electromigration  will  be  supplied  by  routines  similar  to  this,  but  are  discussed  under  this  module  for  compactness 
of  presentation. 

In  order  that  a  molecular  dynamics  simulation  of  a  metal  properly  reproduce  the  elastic  and  thermal  properties  (e.g.. 
Young’s  modulus,  thermal  expansion  coefficient)  characteristic  of  real  metallic  systems,  it  has  been  found  necessary 
by  several  researchers  [3,4,5,6,7]  to  describe  the  system  energy  by  functions  more  sophisticated  than  pair  potentials. 
This  is  because  a  major  portion  of  the  cohesive  energy  in  metal  systems  arises  from  the  interaction  of  the  metal  ions 
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with  the  "gas"  of  nearly  free  conduction  electrons  rather  than  from  the  direct  interaction  of  the  ions  with  their 
neighbors.  On  this  account,  we  have  begun  to  adapt  the  effective  medium  theory  of  Jacobsen  et  al.[3],  as  extended 
by  Hakkinen  and  Manninen  [6]  to  MDEM.  Fortunately,  this  recent  literature  provides  a  background  of  work  on 
aluminum  and  copper  that  will  be  directly  applicable  to  the  electromigration  problem. 

Preliminary  tests  of  the  code  will  be  made  using  simple  pair  potentials  of  the  Lennard-Jones  or  Morse  type. 

It  is  important  to  provide  the  simulation  with  a  means  to  add  or  remove  energy  from  the  system.  In  nature,  this 
is  accomplished  by  exchanges  of  energy  between  the  system  and  its  environment,  but  simulations  caimot  afford  the 
luxury  of  nearly  infinite  heat  reservoirs.  Hoover  [8]  has  shown  a  simple  technique  for  temperature  control  that 
properly  preserves  the  fluctuations  characteristic  of  the  canonical  ensemble  distributions  of  statistical  mechanics. 
This  technique  was  employed  by  Valkealahti  and  Manninen  [7]  in  their  work  on  the  melting  of  copper  clusters. 

In  short,  the  idea  is  to  endow  some  of  the  atoms  in  the  simulation  with  an  additional,  velocity-dependent  force  that 
behaves  like  a  friction  force  except  that  the  friction  coefficient  may  be  either  positive  or  negative.  By  linking  the 
friction  coefficient  to  the  average  system  kinetic  energy  (temperature),  energy  may  introduced  or  extracted  from 
the  system  as  required  to  stabilize  the  system  temperature.  The  obvious  location  for  "thermostat"  atoms  is  at  the 
face  of  the  crystallite  intended  to  simulate  the  substrate  since  that  face  would  in  nature  serve  as  the  main  heat  sink 
for  the  conductor. 

For  numerical  experiments  involving  thermal  effects  alone,  this  thermostat  mechanism  can  be  used  to  maintain 
thermal  equilibrium.  For  experiments  also  involving  the  PR  heating  associated  with  an  electron  wind,  the 
thermostats  will  allow  the  maintenance  of  steady-state  conditions  including  thermal  gradients.  Thus,  the  simulation 
may  be  able  to  complement  the  work  by  Rome  Lab’s  Dr.  Mark  Levi  on  the  importance  of  thermal  gradients  to  the 
electromigration  process. 

Finally,  the  passage  of  an  electric  current  through  the  sample  will  be  simulated  by  adding  random  momentum 
increments  (biased  appropriately  in  the  direction  of  the  electron  drift)  to  the  metal  atoms  in  the  sample.  It  is 
anticipated  that  these  momentum  increments,  when  applied  to  atoms  immediately  upstream  from  vacancies,  will 
prove  to  be  the  agent  responsible  for  the  preferential  migration  of  metal  atoms  along  the  direction  of  the  electron 
flow  and  of  vacancies  in  the  opposite  direction. 

It  is  difficult  in  laboratory  experiments  to  discover  whether  this  electromigration  scenario  is  accurate.  The  reason 
is  that  in  normal  circuit  operation  the  process  is  slow.  In  terms  of  the  atomic  picture,  the  significant  collisions  are 
rare,  and  they  are  accompanied  by  vast  numbers  of  electron-phonon  collisions  that  are  responsible  for  Joule  heating 


8-7 


but  are  relatively  ineffectual  as  a  cause  of  metal  migration.  Experimental  attempts  to  accelerate  the  migration- 
stimulating  collisions  are  accompanied  by  heating  effects  that  alter  the  conditions  under  which  electromigration 
occurs  in  practice. 

The  simulation  offers  the  possibility  of  separating  the  effects  of  collisions  between  electrons  and  vacancy-proximate 
atoms  from  those  that  merely  add  heat  to  the  lattice.  It  seems  reasonable,  at  least  during  the  nucleation  stages  of 
void  formation,  that  these  two  mechanisms  are  independent  and  their  effects  additive.  If  this  is  so,  it  may  be 
possible  to  accelerate  this  atom-plus-vacancy  migration  mechanism  without  materially  increasing  the  Joule  heating. 


This  could  be  done,  for  example,  by  applying  a  heavy  dose  of  electron-wind  to  randomly  selected  atoms  during  each 
time  step,  moving  the  atoms  according  to  Newton’s  laws,  and  then  scaling  the  speed  of  each  atom  down  to  prevent 
the  average  kinetic  energy  (temperature)  from  rising.  If  atoms  next  to  vacancies  are,  indeed,  the  primary  transport 
objects,  this  method  would  reveal  that  fact  without  increasing  the  system  temperature.  The  result  will  be  to  compress 
the  time  scale  on  which  effects  of  this  electron-wind  model  emerge  while  controlling  the  sample  temperature 
independently. 

Whether  this  artificial  method  of  temperature  control  will  produce  the  same  migration  effects  as  result  from 
prolonged  current  flow  at  lower  current  densities  is  a  question  that  can  be  explored  with  the  simulation. 


randn  &  gausdevn  (code  completed) 

These  are  (ps'eMdo)random  number  generators,  called  upon  by  many  routines.  Although  generic  linear  congruential 
random  number  generators,  supplied  with  most  compilers,  might  be  adequate  for  some  of  the  simulation’s 
requirements  (e.g.,  assigning  random  initial  velocities  and  displacements  to  atoms  in  the  system),  they  are  always 
suspicious,  and  undoubtedly  insufficient  for  electron  wind-lattice  interactions.  The  algorithms  chosen  here  are 
adaptations  of  the  work  of  Press  et  al  [2].  Randn  produces  {pseudo)miAota  numbers  with  uniform  distribution, 
gausdevn  yields  a  Gaussian  distribution. 


Additional  routines  that  are  not  linked  with  MDEM  include; 


comb  data  (code  to  be  written) 
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This  combs  through  the  atomic  data  array  collecting  information  for  such  calculations  as  mean  square  displacements, 
radial  distribution  functions,  and  density  profiles. 

view3d  (under  development) 

This  presents  a  perspective  view  of  the  sample  atoms  so  that  the  user  may  form  visual  impressions  of  the  sample. 
Naturally,  this  is  where  portability  breaks  down  since  graphics  standards  vary  with  platforms.  This  important  task, 
however,  is  not  as  computationally  intensive  as  the  simulation  itself,  so  it  will  not  be  necessary  to  adapt  it  to  every 
machine.  A  program  that  allows  one  to  visually  scan  through  the  sample  is  being  developed  for  some  of  the  most 
popular  operating  systems. 

2.  Preparation  of  test  sample  configurations 

The  first  step  in  simulating  a  conducting  line  is  to  create  the  line.  In  the  manufacture  of  microcircuits  the  line  is 
prepared,  first  by  depositing  a  film  of  metal  atoms  onto  a  substrate,  and  then  by  etching  away  unwanted  regions 
of  the  film.  During  the  deposition,  islands  of  metal  grow  around  nucleation  sites  randomly  located  on  the  substrate. 
Each  island  may  be  thought  of  as  a  randomly  oriented  and  imperfect  crystallite  or  grain.  The  islands  grow  until 
adjacent  grains  meet,  and  the  resulting  interfaces  become  grain  boundaries  of  a  polycrystalline  film.  What  remains 
after  the  etching  stage  is  a  polycrystalline  conductor  whose  transverse  dimensions  are  comparable  in  size  with  the 
grains. 

Preparation  of  a  simulated  polycrystalline  conductor  may  be  accomplished  in  at  least  two  ways.  The  starting 
configuration  may  be  tailored  to  test  various  idealized  arrangements  of  grain  boundaries,  vacancy  and  impurity 
concentrations.  Alternatively,  it  is  possible  to  simulate  the  island  growth  process  itself. 

Completed  components  of  MDEM  provide  for  the  first  alternative.  The  user  of  the  program  can  assemble  a 
polycrystalline  sample  by  specifying  the  shape,  position  and  orientation  of  each  grain.  Furthermore,  each  grain  may 
be  specified  to  contain  atoms  of  a  given  species  and  be  interspersed  with  vacancies  and  impurity  atoms  of  other 
species.  While  most  grains  will  contain  "metal"  atoms,  some  may  be  populated  with  atoms  bound  more  tightly  than 
those  simulating  metals,  and  be  deployed  as  substrate  and  passivation  structures. 

Simulation  of  the  deposition  process  may  be  of  considerable  value  if  it  can  provide  more  realistic  starting 
configurations.  We  hope  to  be  able  to  include  this  as  an  additional  facility. 
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3. 


Collection  of  simulation  data 


The  simulation  will  necessarily  maintain  a  record  of  the  current  state  of  the  system  in  microscopic  detail;  that  is, 
the  positions  and  velocities  of  all  the  atoms  and  of  the  time  elapsed.  In  order  to  interpret  these  data  as  they  evolve 
in  time,  it  will  be  necessary  to  obtain  periodic  summaries  of  various  macroscopic  statistics.  Preliminary  parameters 
that  will  be  calculated  from  the  position/velocity  data  include  the  elastic  moduli  and  thermal  expansion  coefficients 
of  the  modeled  metal.  Quantities  more  closely  related  to  the  electromigration  process  include  system  energy,  density 
and  temperature  gradients  and  fluctuations,  and  diffusion  coefficients. 


4.  Comparison  of  simulated  and  experimental  data 

The  data  collected  from  the  simulation  may  be  displayed  as  tables,  graphs  and  visualizations.  After  ascertaining 
that  the  simulation  properly  reproduces  the  known  mechanical  and  thermal  properties  of  the  metal,  it  will  be  of 
major  interest  to  determine  the  effects  of  the  orientation  of  grain  boundaries,  relative  to  the  direction  of  current 
flow,  on  the  transport  of  atoms  and  on  the  concentration  of  vacancies.  Arc  these  effects  consistent  with  observed 
electromigration  phenomena?  We  anticipate  that  graphics  to  present  three  dimensional  visualizations  will  be  crucial 
to  these  comparisons. 


Final  comment 

The  generality  of  this  simulation  scheme  is  noteworthy.  In  addition  to  the  experiments  already  indicated,  other 
applications  are  easily  imagined.  Studies  of  thermal  and  mechanical  stress  effects  could  be  made  in  conjunction  with 
electromigration  or  separately.  Looking  even  farther  afield,  nothing  inherently  restricts  the  modeled  system  to  the 
solid  state.  Studies  of  interfaces  between  (or  among)  any  of  the  phases  could  easily  be  accommodated. 

A  project  of  this  magnitude  could  not  have  been  completed  in  the  twelve  weeks  of  the  Summer  Research  Program. 
Consequently,  specific  results  are  not  reportable.  We  have,  instead,  presented  an  outline  of  the  project,  and 
indicated  the  progress  made  toward  completion.  These  accomplishments  have  led  to  a  the  negotiation  of  a  contract 
to  complete  the  simulation  at  Rome  Lab  between  September,  1993  and  July  1994.  The  results  of  preliminary 
studies  are  to  be  reported  at  the  1993  Fall  Meeting  of  the  Materials  Research  Society  [9]. 
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Estimating  Tlie  Number  Of  Defects 
Under  Imperfect  Enviromiicntal  Stress  Screen  Levels 

Lifang  Hsu 
Associate  Professor 
Department  of  Mathematics 
Le  Moyne  College 

Abstract 

A  fixed  sample  size  procedure  is  proposed  to  quantify  the  initial  and  refined  estimates  of  the  detect  of 
electronics  density  for  Environmental  Stress  Screening  (ESS)  plans.  The  idea  of  group  testing,  i.e., 
testing  units  in  batches  instead  of  individually,  where  each  test  indicates  whether  the  test  batch  contains 
only  good  parts  or  whether  it  contains  at  least  one  defective,  is  applied  to  the  problem  of  estimating  the 
probability,  ic,  of  an  arbitrary  unit  being  defective.  Five  estimators  are  given  with  their  root  mean  square 
error,  or  asymptotic  variance.  An  example  and  table  of  procedure  parameters  are  given  to  illustrate  the 
proposed  method. 
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Estimating  The  Number  Of  Defects 
Under  Luperfect  Environmental  Stress  Screen  Levels 

Lifang  Hsu 

I.  Introduction 

This  report  proposes  a  statistical  methodology  to  quantify  the  initial  and  refined  estimates  of  the 
number  of  defects  resident  in  electronic  equipment  subject  to  Enviromnental  Stress  Screening  (ESS).  ESS 
plans,  which  are  applied  during  the  development  and  production  phases,  can  yield  significant 
improvements  in  field  reliability  and  reductions  in  field  maintenance  costs.  One  parameter  important  in 
the  economical  design  of  ESS  plans  is  the  degree  of  contamination;  i.e.,  the  proportion  of  parts  with  latent 
defects  in  an  otherwise  well  designed  equipment.  This  proportion  governs  the  ESS  process  and 
determines  the  rate  at  which  the  purification  proceeds.  Tlie  ESS  methodology,  as  described  in  MIL- 
HDBK-344  (1993)  estimates  the  number  of  incoming  defects  based  on  the  equipment  complexity  and  the 
reliability  levels  of  the  various  parts  and  processes.  This  is  accomplished  by  describing  the  equipment  in 
terms  of  the  quantity  of  items  within  various  part-process  categories  and  multiplying  these  by  an  expected 
fraction  defective  for  each  of  the  part-reliability  grades  and  process  types.  The  fraction  defectives  are 
determined  using  lookup  tables  found  in  MIL-HDBK-217  (1991)  and  have  different  values  depending 
upon  the  end  application  operating  enviromnent.  Because  MIL-HDBK-217  (1991)  defect  density  data  is 
generic,  it  may  not  be  sufficiently  current  for  certain  programs.  In  this  report,  we  propose  a  fixed  sample 
size  procedure  to  provide  the  estimates  of:  (1)  part  defect  density,  (2)  equipment  defect  density,  and  the 
(3)  proportion  of  part  and  manufactming  defects.  Since  the  number  of  defects  in  equipment  depends  on 
the  quantity  and  quality  of  parts  and  workmanship  characteristics,  we  assume  that  there  are  k  independent 
and  identically  distributed  parts  in  the  equipment  (the  procedure  could  also  be  used  and  generalized  for 
equipment  with  multiple  part  types).  The  quality  of  parts  is  characterized  by  the  probability,  k,  of  a  part 
being  defective.  The  defective  probability  ic  is  an  unknown  parameter  and  its  value  depends  upon  the  end 
application  operating  enviromnent.  The  proposed  experiment  is  to  apply  base  line  screening  tests  with 
known  sensitivity  q  and  specificity  6  to  a  given  m  equipments.  The  value  q  is  the  probability  that  an 
equipment  is  defective  given  that  at  least  one  part  is  defective.  It  is  determined  from  the  product  of 
precipitation  efficiency  and  detection  efficiency.  The  precipitation  efficiency  for  a  given  stress  type,  level, 
and  duration  is  determined  by  using  lookup  tables  or  solving  equations  that  are  provided  in  MIl^HDBK- 
344  (1993).  The  test  detection  efficiency  is  determined  based  on  the  type  of  testing  perfonned  and  whether 
testing  is  performed  during  exposure  to  stress.  The  value  6  is  the  probability  that  the  equipment  is  not 
defective  given  that  all  the  parts  are  good.  The  value  of  6  is  determined  [Barlow,  Bazorsky,  and  Wechsler 
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(1991)]  by  the  equation  (T/(T+  t))K  where  T  is  the  expected  life  lime  of  a  good  part  and  t  is  the  screen 
duration.  Based  on  an  analysis  of  screening  fallout  data  and  subsequent  failure  analysis,  an  improved 
environmental  stress  screening  level  with  a  new  sensitivity  q'  and  a  new  specificity  0'  will  be  generated. 
Note  that  the  base  line  stress  levels  should  always  be  higher  than  the  application  stress.  To  account  for 
non-relevent  type  failures,  i.e.,  false  alarms,  or  non-operational  failures,  the  defective  equipments  from  the 
base  line  screening  will  then  be  subjected  to  the  improved  stress  sereening  test.  Sinee  the  majority  of  parts 
and  workmanship  operations  are  defect  free,  only  some  small  fraction  will  contain  flaws  or  defects  that 
can  be  measured  as  a  defect  density  in  defects  per  million.  The  sum  of  the  defects  from  parts  and 
workmanship  operations  determines  the  total  defects  per  equipment.  The  model  and  procedure  developed 
here  is  similar  to  that  arising  in  group  testing.  The  ideas  of  group  testing,  i.e.,  testing  parts  in  equipments 
instead  of  individually,  where  each  test  indicates  whether  the  tested  equipment  contains  only  good  parts 
or  whether  it  contains  at  least  one  defect,  is  applied  to  the  problem  of  estimating  the  probability  tc  of  an 
arbitrary  part  being  defective.  The  ideas  of  group  testing  were  first  used  by  [Dorfman  (1943)].  However, 
the  original  purpose  of  Dorfman's  procedure  was  to  minimizes  the  number  of  tests  needed  to  identify  all 
carriers  of  a  disease  in  a  large  population.  Another  aspect  of  the  group  testing  problem  was  in  the 
estimation  of  the  frequency  of  a  disease  or  of  some  property,  which  was  studied  by  [Sobel  and  Elashoff 
(1975)].  In  that  paper,  they  assume  that  the  classification  process  is  perfect.  For  this  efi'ort  we  assume  that 
the  classification  process  is  imperfect. 

This  report  is  organized  in  the  following  manner:  Section  2  describes  the  problem  and  mathematical 
model  in  detail.  Section  3  investigates  the  estimation  procedures  and  properties  of  the  estimator.  Section  4 
gives  an  example  and  tables  from  procedure  parameters  to  illustrate  the  proposed  method. 


II.  Description  of  the  problem  and  the  mathematical  method 

When  estimating  the  number  of  defects  resident  in  an  equipment,  a  preliminary  estimate  can  be  based 
on  the  results  of  screening  tests.  A  random  sample  of  m  equipments  taken  from  the  production  line  are 
subject  to  a  baseline  stress  screening  test.The  initial  estimate  of  percentage  defect  can  be  derived  by  using 
the  results  of  the  base  line  stress  screening  test.  Based  on  an  analysis  of  the  screening  fallout  data  and 
failure  analysis,  an  improved  stress  screening  test  will  be  developed.  Those  equipments  classified  as 
defective  are  given  an  improved  stress  screening  test.  The  estimate  can  then  be  refined  based  on  the 
results  of  both  the  base  line  stress  screening  and  the  improved  stress  screening  tests. 
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Before  proceeding  with  the  description  of  the  model,  the  parameters  that  determine  the  accuracy  and 
the  predictive  value  of  the  tests  arc  defined: 

k  -  the  number  of  parts  in  an  equipment; 

m  -  the  number  of  equipments  to  be  tested; 

7t  -  the  probability  of  a  part  being  defective. 

Bj  =  1  if  ith  equipment  is  defective  after  base  line  stress  screening  test, 

0  otherwise; 

Ij  =  1  if  ith  equipment  is  defective  after  improved  stress  screening  test, 

0  otherwise; 

Cj  =  1  if  ith  equipment  contains  at  least  one  latent  defect, 

0  otherwise; 

q  =  P(Bj  =  1  I  Cj  =  1)  -  the  probability  that  the  latent  defect  will  not  escape  from  the 
screen,  i.e. ,  the  sensitivity  of  the  base  line  stress  screening  test; 

0  =  P(Bj  =  0  I  Cj  =  0)  -  the  probability  that  a  good  part  will  survive  the  screen,  i.e. 
the  specificity  of  the  base  line  stress  screening  test; 

n'  =  P(Ij  =  1  I  Cj  =  1)  -  the  sensitivity  of  the  improved  stress  screening  test; 

0'  =  P(Ij  =  0  I  Cj  =  0)  -  the  specificity  of  the  improved  stress  screening  test; 

We  assume  the  base  line  stress  screening  test  and  improved  stress  screening  test  are  independent.  Their 
sensitivity  and  specificity  do  not  depend  on  the  number  of  parts  in  the  equipment.  We  base  our  inferences 
of  the  initial  estimate  on  the  statistic: 
m 

B  =  E  Bj ,  the  number  of  defective  equipments  on  base  line  stress  screening  test. 
i=l 

The  random  variable  B,  has  a  binomial  distribution: 
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(1) 


f(b)  =  f  m '1  p*’(l-p)™'^,  b  =  0,l,...,m,  0^p<  1. 

lb  J 

where  p  =  P(  Bj  =  1)  =  P(  Bj  =  1,  Cj  =  1)  +  P(  Bj  =  1,  Cj  =  0) 

=  P(Ci  =  l)P(Bi  =  1|  Cj  =  1)  +  P(Ci  =  0)P(Bi  =  1|  Cj  =  0)  (2) 

=  (1  -(1-K)*^)ri  +(1  -7c)*'(l-0). 

We  base  our  inferences  of  the  refined  estimate  on  the  following  statistics: 
m 

X  =  £  Bj  Ij ,  the  number  of  defective  equipment  on  both  screening  tests; 
i=l 
m 

y  =  £  Bi(  1  -  Ij) ,  the  number  of  defective  equipment  on  base  line  stress  screening  test  only; 
i=l 
m 

z  =  £  (1  -  Bj) ,  the  number  of  defective  free  equipment  on  base  line  stress  screening  test. 
i=l 

The  random  variables  x,y,z  have  a  trinomial  distribution: 

3 

f(x,y,z)  =  f  m  1  Pi’‘P2yp3^>  £  pj  »  I,  x  +  y  +  z  =  m  (3) 

[xyz  J  i=l 

where  pj  =  P(Bj  =  1,  Ij  =  1)  =  P(Bj=l,  Ij=l,Cj=l)  +  P(Bj=l,  Ij=l,  Cj=0) 

=  PfCj  =  l)P(Bi=l|Ci=l)P(Ii=l|Ci=l)  +  P(Ci=0)P(Bi=l|Cj=0)P(Ii=l|Ci=0) 

=  [l-(l-7:)k]nq'  +  (l-x)k(l-0)(l-0'), 

P2  =  P(Bi  =  1,  Ij  =  0)  =  P(Bi=l,  li=0,  Cj=l)  +  P(Bi=l,  li=0,  Ci=0)  (4) 

=  PfCj  =  l)P(Bi=l|Ci=l)P(Ii=0|Ci=l)  +  P(Ci=0)P(Bi=l|Ci=0)P(Ii=0|Ci=0) 

=  [l-(l-Jt)>']il(l-n')  +  (l->c)’'(l-0)0', 

P3  =  P(Bi  =  0)  =  P(Bi=0,  Ci=  1)  +  P(Bi=0,  Ci=0) 

=  PfCj  =  l)P(Bi=0|Ci=l)  +  P(Ci=0)P(Bi=0|Ci=0) 

=  [l-(l-n)l'](l-n)  +  (l-Jt)l«0. 

The  proportion  of  equipment  being  defective  on  the  base  line  stress  screening  test  is  defined  as 

P(Bi  =  1  )  =  (1  -  (l-7r)k)n  +  (1  -  7t)J«(l-0).  (5) 

The  proportion  of  part  defects  given  that  the  equipment  is  defective  on  the  base  line  stress  screening 

test  is  defined  as 

P(Ci  =  1|  Bj  =  1)  =  P(Ci  =  1,  Bi=l)/P(Bi  =  1) 

=  (1  -  (l-7t)J')n  /[  (1  -  +  (1  -  ir)*Hl-0)l-  (6) 

The  proportion  of  manufacturing  defects  given  that  the  equipment  is  defective  on  the  base  line  stress 
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screening  test  is  defined  as 

P(Ci  =  0|  Bi  =  1)  =  P(Ci  =  0,  Bi=l)/P(Bi  =1) 

=  (I-TC)*'  0  /[  (1  -  (l-it)*^)ii  +  (1  -  7c)^l-0)].  (7) 

The  proportion  of  equipment  being  defective  on  the  unproved  stress  screening  test  is  defined  as 

P(Ii  =  1  )  =  (1  -  (l-7c)l^)q' +  (1  -  n:)^l-0').  (8) 


The  proportion  of  part  defects  given  that  the  equipment  is  defective  on  the  improved  stress  screening 
test  is  defined  as 

P(Ci  =  l|  Ij  =  1)  =  P(Ci  =  1,  Ij=l)/P(Ii  =  1) 

=  (1  -  (l-it)*')r)'  /[  (1  -  (l-7:)l^)n'+  (f  -  7c)l^(i-0')].  (9) 

The  proportion  of  manufacturing  defect  given  that  the  equipment  is  defective  on  the  improved  stress 
screening  test  is  defined  as 
PfCj  =  0|  Ij  =  f)  =  P(Ci  =  0,  Ii=i)/P(Ii  =1) 

=  (l-Tc)*'  (l-0')/[  (f  -  (l-7c)*^)ri'  +  (1  -  7i:)*^(l-0')].  (10) 

Our  inferences  are  based  on  the  sensitivities  (q,q')  and  specificities  (0,0')  of  the  base  line  stress 
screening  and  improved  stress  screening  tests  when  applied  to  m  equipments  of  k  identical  parts.  For  the 
equipment  with  multiple  part  types,  the  same  procedure  can  be  applied  to  different  part  types  separately. 


III.  Estimation  Procedures  and  Properties  of  the  Estimator 
Initial  Estimation 

While  the  probability  of  the  m  observed  test  results  is  a  function  of  a  number  of  parameters  (q,0,m,k, 
and  It),  in  the  procedure  developed  here  we  assume  all  parameters  except  it  are  known.  The  probability 
distribution  of  the  observation  is  the  likelihood  function  L(b|it)  which  equals  the  binomial  distribution 
stated  in  (1).  The  maximum  likelihood  estimator  (MLE)  of  it  is  easily  determined  since  L(b|it)  is  twice 
differentiable. 

An  initial  estimator  of  it,  itjj,  and  its  asymptotic  variance  are  obtained  by  solving  the  following 
equations; 

dl98L(t»Nl  I  =[(b/p)(dp/dit)]-([m-b]/[l-p])(dp/dit)  =  0  (11) 

dit  I  itjj 

aVar(itjj)  =  -  {  E  [  d^logLlblitl  ] 
dit^ 
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=  {E[(b/p2)(dp/d7t)2  -  (b/p)(d2p/d7c2)  +  ((m-b)/(l-p)^)(dp/d7t)2 

+  ((m-b)/(l-p))(d2p/dJc^)ir*  (12) 

where  p  is  given  in  (2)  and  aVar  denotes  the  variance  of  the  asymptotic  distribution  of  tin 
[Freeman  ( 1963)].  The  solutions  to  equations  (11)  and  ( 12)  are 

TCji  =  1  -  ([q  -  (b/m)]/(n+e-l))*/*'  (13) 

aVar(7tii)  =  {[q  -  (q+e-l)(l-7c)l«][l-q  +  (q+0-1)  (l-Jc)l^]|/|mk2(q+0-l)2(l-7t)2k-2|  (14) 

For  the  binomial  distribution  it  is  known  that  the  MLE  is  admissible  with  respect  to  the  given  loss 
[Johnson  (1971)].  Hence  ten  is  admissible  with  respect  to  the  square  error  loss.  In  Table  I  values  of  initial 
estimate  njj  are  listed  for  m  =  100,  q  =  .99, 0  =  .90,  and  k  =  1,2,4,8,16,32,64,72,  and  100.  We  note  that 
the  values  for  b  (the  observed  number  of  defective  equipment)  given  in  Table  I  are  greater  than  10  and  less 
than  99.  Since  icjj  in  (13)  exists  only  when  m-m0  <  b  <  mq. 

Another  estimator  I’®  obtained  by  taking  the  first  term  of  the  MacLaurin  series  of  ttjj, 

expanded  about  b  =  0. 

rtii  =  1  -  ([q  -  (b/m)]/(q+0-l))l/l'  =  1  -  (q/(q+6-l))l/l«(t  -  b/(kmq)  -(k-l)b2/2(kmq)2  -..) 

Jtl2  =  1  -  (q/(q+0-l))l/^l  -  b/(kmq))  (15) 

When  k  =  1,  =  7C[2-  The  expected  value,  bias,  and  variance  of  ttj2  are 

E(7Ci2)  =  1  -(q/(  q+0-l))l/l'[((k-l)q  +  (q+0-1)  (l-trlJ^l/lkq)] 

Bias(7Ci2)  =  1  -  -  (q/(  q+0-l)l%(k-l)q  +  (q+0-1)  (l-n)*^)/(kq)] 

Var(7Ci2)  =  {(q/(  q+0-l))2/*‘[q  -  (q+0-1)  (l-ic)*^]!!  -fi  +  (q+0-1)  d-tt)’'])  /(mk2q2) 

In  Table  II  values  of  the  root  mean  square  error  (RMSE)  of  ej2  and  (aVar(7Cjj))(/2  are  listed  for  m  = 
100,  (q  =  .99  ,0  ■=  .90),(q  =  .95  ,0  =  .85),  and  (q  =  .99  ,0  =  .99).  We  note  that  the  RMSE  associated  with 
rtj2  is  smaller  than  the  standard  error  for  the  MLE  ttjj  for  small  values  of  tc. 

The  proportions  Pbii(EbI2)  equipment  being  defective  on  the  base  tine  stress  screening  test  can  be 
estimated  by  replacing  Jtjj(7:j2  )  for  k  m  (5).  The  solutions  are: 

PbII  =  (1  -  (1  -  +  (  1  -  »tn)*Hl-0)  =  b/m  (16) 
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PbI2  “  (1  -  (1  -  'ti2)^)ii  +  (  1  -  rti2)^(l-e)  =  till  -  (1  -  (b/kiTin)*^] 


(17) 


Similarly,  the  estimates  of  proportions  of  parts  and  manufacturing  defects  given  that  the  equipment  is 
defective  on  the  base  line  stress  screening  test  can  be  obtained  by  the  following  equations: 

PsiiCpart  defect)  =  q(m0  -  m  +  b)/[b(q  +  6-1)]  (18) 

PbjI (manufacturing  defect)  =  [m(mq  -  b)(l  -  6)]/[b(q  +  0-1)]  (19) 

defect)  =  [(kmn)^(q  +  0  -  1)  -  (kmq  -  b)*^q]/[(q  +  6  -  l)[(kmq)*^  -  (kmq  -  b)^]]  (20) 

PBi2(manufacniring  defect)  =  [(kmq  -  b)*^(l  -  0)]/[(q  +  6  -  l)[(kmq)^  -  (kmq  -  b)^]]  (21) 


Refined  Estimation 

The  joint  probability  of  the  observed  test  results  is  a  function  of  (q,6,q',0',m,k,  and  n),  we  assume  all 
parameters  except  tc  are  known.  The  joint  probability  distribution  of  the  observations  is  the  likelihood 
function  L(x,y,z|7c)  which  equals  the  trinomial  distribution  stated  in  (3). 

The  maximum  likelihood  estimator  and  its  asymptotic  variance  can  be  obtained  by  solving  the 
following  equations: 

dlggLC2LY.^[gl  I  =  [(x/pi)(dpi/d7t)]  +  [(y/P2)(dp2/dJt)]  +  I(z/p3)(dp3/d7t)]  =  0  (22) 

djc  I  Jtjj 

Var(7CR)  =  -  {  E  [  d2logL(x.v.7.l7tl  ]  J'l 
dn^ 

=  { E[(x/p  i2)(dp  j/djt)2-(x/p  1  )(d2p  j/d7t2)+(y/p22)(dp2/d7t)2-(y/p2)(d2p2/d7c2) 

■*’(z/P3^)(dp3/d7r)2- (z/p3)(d2p3/d7t2)  j-1  (23) 

where  pj,  P2,  and  P3  are  given  in  (4). 

The  solution  to  equation  (22)  and  (23)  are: 

TtR  =  1  -  { [-b  +  (b2  -4ac)V2]/(2a)}  1/k  (24) 

where  a  =  -m(q+6-l)[(l-0)(l-6')-qq'][(l-0)0'-(l-q')q] 

b  =  x[ii(i-Ti')-6'(i-6)][(i-0)(i-0')(i-n)-qn'0]  +  y{(i-e)(i-e')-nn'][n0(i-n') 

-0'(i-0)(i-ri)]  -mii(n+6-i)[(i-ii')[(i-e)(i-0')-nTi']  +  n'[(i-e)0'-n(i-n')] 
c  =  Ti{x(i-n')[nn'0-(i-0)(i-0')(i-n)]  +  yn'[n(i-n')0-0'(i-0)(i-n)]  -mtin'(i-n')(n+0-i)l 
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aVar(7rR)  =  {mk2(l-7r)2k-2[(^^-.(1.0)(l.e-))2/(^^-.(^^-.(,.e)(l.e'))(i.rt)k)  + 

(nd-n')  -e'(i-e))2/(n(i-n')-(n(i-n')  -e'(i-em-K)h  + 

(i-n-e)2/((i-ti)  -(i-n-e)(i-K)k)]}-i  (25) 

For  the  trinomial  distribution  it  is  known  that  the  MLE  is  admissible  with  respect  to  the  square  error 
loss  [Alam(1979)],  hence  is  also  admissible. 

The  proportion  Pj^  of  equipment  being  defective  on  the  improved  stress  screening  test  can  be  estimated 
by  replacing  Jt{^  for  k  in  (8).  The  solution  is 

PR(equipment)  =  (1  -  (l-7tR)^)q' +  (1  -  7tR)^l-0').  (26) 

The  estimates  of  the  proportion  of  part  Pj^(part)  and  manufacturing  Pj^(manufacturing  defect)  defects, 
given  that  the  equipment  is  defective  on  the  improved  stress  screening  test  can  be  obtained  by  the 
following  equations: 

PR(part  defect)  =  (1  -  (l-7CR)k)q7[  (1  -  (l-7:R)k)q'+  (1  -  icr)^  (1-0')].  (27) 

PR(manufacturing  defect)  =  (I-kr)^  (1-0)(1-0')/[  (1  -  (l-nR)k)q'q  +  (1  -  nR)k(l-0)(l-0')].  (28) 

We  will  assume  q'“l,  0'=l  to  do  the  further  analysis  for  the  properties  of  the  refined  estimators.  Under 
this  assumption,  an  estimator  7Cr|  and  its  asymptotic  variance  aVar(7tR|)  are  obtained  as  following: 

7CR1  =  1  -  {  [-(m+y)(l-n)+(m-x)0]/2m(n+0-l)I  -  l[[(m-x)0-(m-y)(l-r|)]^+4xy(l-q)0]/ 
[2m(n+0-l)]|(/2}l/k  (29) 

aVar(7CRi)  =  -  {  E  [  d^logL(x.v.zlE)  ] 

dji2 

=  { E[(x/p  1 2)(dp  i/dj:)2-(x/p  ( )(d2p  i/dic2)+(y/p22)(dp2/d7t)2-(y/p2)(d2p2/d7t2) 
+(z/P3^)(dp3/d7i:)2  -  (z/p3)(d2p3/d7c2)  )-• 


{ mk(  l-Jc)*'-2[(q [(k- 1 )+( l-7i)k])/(  l-(  I -7t)k)  +  ( 1  -0)  -  [(n+0- 1  )[(k- 1)(  l-q)- 

(n+0-i)(i-ji)’']]/[i-n+(n+e-i)(i-n)‘'ir* 
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A  simpler,  moment  kind  of  estimator,  tcj^  =  x/kmq,  can  be  obtained  by  noting  that 
E(x|m)  =  solving  for  k,  and  taking  the  first  tenn  of  the  MacLaurin  series  of 

7c  =  1  -  ( 1  -  (x/nm))  expanded  about  x=0.  The  expected  value,  bias  and  variance  of  Ttj^  are 

E(ttR2)  =  [l-d-irl'^l/k, 

Biasfttj^)  =  [l-krt-(l-7t)^^]/k, 

Var(7tR2)  =  [l-(l-7t)*^]  [l-n(l-(l-ic)^)]/(k2mn). 

In  Table  III,  values  of  (aVarfnf^))^/^  are  listed  for  m  =  100,  (r)  =  .99  ,6  =  .90,  ri'=.95,  and  6'  =.95), 
and  (q  =  .99,  0  =  .99,  q'  =.95,  and  0'  =  .95).  Values  of  the  root  mean  square  error(RMSE)  of  and 
(aVar(7Cj^j))^^^  are  listed  for  m  =  100,  (q  =  .99  ,0  =  .90),  and  (q  =  .99  ,0  =  .99).  We  note  that  the 
PMSE  associated  with  simple  estimator  Ttj^  is  larger  then  the  standard  error  for  the  MLE  for  large 
values  of  ti.  However,  for  smaller  re  the  simple  estimator  is  quite  good. 


IV.  Numerical  Example 

Consider  a  random  sample  of  100  equipments  with  16  independent  identical  parts  in  each  equipment. 
We  will  use  the  proposed  procedure  to  provide  the  initial  and  refined  estimates  of :  (1)  part  defect  density, 
(2)  equipment  defect  density,  and  (3)  proportion  of  parts  and  manufacturing  defects.  Suppose  the 
expected  life  time  of  a  good  part  is  150  times  the  base  line  screening  duration,  and  the  base  line  screening 
performs  according  to  the  following  factors: 

(i)  Temperature  cycling  screening  is  performed  15  cycles  at  20^C/min  over  a  lOO^^C  range. 

(ii)  Functional  and  parametric  type  of  testing  is  perfonned  with  detection  value  =  .99. 

(iii)  Testing  performed  concurrently  with  stress;  i.e.  detection  value  is  1.0. 

(iv)  Probability  of  detecting,  isolating  and  removing  the  defect  is  1.0. 

Then,  precipitation  efficiency  =  1  -  exp[-(.0017(100+.6)-^(ln(20+2.718)]^(15)] 

=  .99999 

sensitivity  =  q  =  .99999  x  .99  x  1  x  1  =  .99 
specificity  =  0  =  (150t/(150t+t))^^  =  .90 
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Assume  30  equipments  are  observed  defective,  then  from  (13),  (16),  (18)  and  (19),  we  obtain: 
the  initial  estimate  of  part  defect  density  =  .0157822. 
the  initial  estimate  of  equipment  defect  density  Pg|  (equipment)  =  .30. 
the  initial  estimate  of  proportion  of  part  defect  density  Pg((part  defect)  =  .74. 
the  initial  estimate  of  proportion  of  manufacturing  defect  density 
Pgj  (manufacturing  defect)  =  .26. 

If  5  of  the  30  defective  equipments  were  observed  during  the  first  4  cycles,  then  an  improved 
environmental  stress  screening  level  will  be  generated  with  new  precipitation  efficiency  .96293,  new 
sensitivity  .95,  and  new  specificity  .96.  From  (24),(26),(27)  and  (28),  we  have: 
the  refined  estimate  of  part  defect  density  itg  =  .0182053 
the  refined  estimate  of  equipment  defect  density  Pg(equipmcnt)  =  .28. 
the  refined  estimate  of  proportion  of  part  defect  density  Pg(part  defect)  =  .86. 
the  refined  estimate  of  equipment  defect  density  Pg(eqiiipment)  =  .14. 

The  proposed  procedure  allows  the  characterization  of  defect  density  to  be  expressed  as  a  function  of 
environmental  stress  screening  level.  It  will  be  verified  by  the  engineers  at  Rome  Laboratory,  then 
incorporated  into  the  appropriate  documents. 
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TABLE  I 


Initial  Estimate  when  m=  100,  x\=.99,  and  0=.9O; 


b 

k=l 

k=2 

k=4 

k=8 

k=16 

k=32 

k=64 

k=72 

k=100 

11 

.011236 

.005633 

.002820 

.001411 

.000705 

.000353 

.000176 

.000156 

.000113 

12 

.022471 

.011299 

.005665 

.002837 

.001419 

.000710 

.000355 

.000315 

.000227 

13 

.033707 

.016998 

.008535 

.004276 

.002140 

.001070 

.000535 

.000476 

.000342 

14 

.044943 

.022730 

.011430 

.005731 

.002869 

.001436 

.000718 

.000638 

.000459 

15 

.056179 

.028495 

.014350 

.007201 

.003607 

.001805 

.000903 

.000802 

.000578 

16 

.067415 

.034295 

.017297 

.008686 

.004352 

.002178 

.001089 

.000968 

.000697 

17 

.078651 

.040131 

.020271 

.010187 

.005106 

.002556 

.001279 

.001137 

.000818 

18 

.089887 

.046001 

.023271 

.011704 

.005869 

.002939 

.001470 

.001307 

.000941 

19 

.101124 

.051909 

.026300 

.013237 

.006640 

.003326 

.001664 

.001479 

.001065 

20 

.112360 

.057853 

.029357 

.014788 

.007421 

.003717 

.001860 

.001654 

.001191 

21 

.123595 

.063835 

.032443 

.016355 

.008211 

.004114 

.002059 

.001830 

.001318 

22 

.134831 

.069855 

.035560 

.017941 

.009011 

.004515 

.002260 

.002009 

.001447 

23 

.146067 

.075915 

.038706 

.019544 

.009820 

.004922 

.002464 

.002190 

.001577 

24 

.157303 

.082014 

.041884 

.021166 

.010639 

.005334 

.002670 

.002374 

.001710 

25 

.168539 

.088155 

.045094 

.022807 

.011469 

.005751 

.002879 

.002560 

.001843 

26 

.179775 

.094337 

.048336 

.024467 

.012309 

.006173 

.003091 

.002748 

.001979 

27 

.191011 

.100562 

.051612 

.026148 

.013160 

.006602 

.003306 

.002939 

.002117 

28 

.202247 

.106830 

.054923 

.027849 

.014023 

.007036 

.003524 

.003133 

.002256 

29 

.213483 

.113142 

.058268 

.029571 

.014896 

.007476 

.003745 

.003329 

.002398 

30 

.224719 

.119500 

.061650 

.031315 

.015782 

.007922 

.003969 

.003528 

.002542 

31 

.235955 

.125903 

.065068 

.033081 

.016679 

.008374 

.004196 

.003730 

.002687 

32 

.247191 

.132354 

.068525 

.034870 

.017589 

.008834 

.004426 

.003935 

.002835 

33 

.258427 

.138854 

.072020 

.036683 

.018512 

.009299 

.004660 

.004143 

.002985 

34 

.269663 

.145402 

.075555 

.038519 

.019448 

.009772 

.004898 

.004355 

.003137 

35 

.280889 

.152002 

.079131 

.040381 

.020398 

.010251 

.005139 

.004569 

.003292 

36 

.292135 

.158653 

.082750 

.042268 

.021362 

.010738 

.005383 

.004787 

.003449 

37 

.303371 

.165357 

.086412 

.044182 

.022340 

.011233 

.005632 

.005008 

.003608 

38 

.314607 

.172115 

.090118 

.046122 

.023333 

.011735 

.005885 

.005232 

.003770 

39 

.325843 

.178929 

.093870 

.048091 

.024342 

.012246 

.006141 

.005461 

.003935 

40 

.337079 

.185800 

.097669 

.050089 

.025366 

.012764 

.006402 

.005693 

.004102 

41 

.348315 

.192730 

.101518 

.052116 

.026407 

.013291 

.006668 

.005929 

.004272 

42 

.359550 

.199719 

.105416 

.054175 

.027464 

.013828 

.006938 

.006169 

.004445 

43 

.370786 

.206770 

.109366 

.056265 

.028540 

.014373 

.007212 

.006413 

.004622 

44 

.382022 

.213885 

.113368 

.058388 

.029633 

.014928 

.007492 

.006662 

.004801 

45 

.393258 

.221064 

.117426 

.060546 

.030745 

.015492 

.007776 

.006915 

.004984 

46 

.404494 

.228310 

.121541 

.062738 

.031877 

.016067 

.008066 

.007173 

.005170 

47 

.415730 

.235625 

.125714 

.064967 

.033029 

.016653 

.008361 

.007435 

.005359 

48 

.426966 

.243010 

.129948 

.067234 

.034202 

.017249 

.008662 

.007703 

.005552 

49 

.438202 

.250468 

.134245 

.069540 

.035396 

.017857 

.008969 

.007976 

.005749 

50 

.449438 

.258001 

.138607 

.071887 

.036613 

.018477 

.009281 

.008254 

.005950 

51 

.460674 

.265612 

.143036 

.074276 

.037854 

.019100 

.009601 

.008538 

.006155 

52 

.471910 

.273302 

.147534 

.076709 

.039119 

.019755 

.009926 

.008828 

.006364 

53 

.483146 

.281074 

.152105 

.079188 

.040410 

.020413 

.010259 

.009124 

.006578 

54 

.494382 

.288932 

.156751 

.081714 

.041727 

.021086 

.010599 

.009427 

.006796 

55 

.505618 

.296887 

.161476 

.084290 

.043072 

.021773 

.010946 

.009736 

.007019 
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b 

k=l 

k=2 

k=4 

k=8 

k=16 

k=32 

k=64 

k=72 

k=100 

56 

.516854 

.304913 

.166281 

.086917 

.044467 

.022476 

.011301 

.010052 

.007247 

57 

.528090 

.313043 

.171171 

.089599 

.045851 

.023194 

.011665 

.010375 

.007481 

58 

.539326 

.321270 

.176149 

.092337 

.047286 

.023929 

.012037 

.010707 

.007720 

59 

.550562 

.329598 

.181219 

.095135 

.048756 

.024682 

.012809 

.011046 

.007965 

60 

.561798 

.338032 

.186386 

.097994 

.050260 

.025454 

.013209 

.011394 

.008216 

61 

.573034 

.346573 

.191652 

.100918 

.051800 

.026244 

.013209 

.011750 

.008474 

62 

.584270 

.355228 

.197023 

.103910 

.053379 

.027056 

.013620 

.012116 

.008738 

63 

.595506 

.364001 

.202505 

.106974 

.054999 

.027888 

.014042 

.012492 

.009010 

64 

.606742 

.372897 

.208102 

.110113 

.056661 

.028744 

.014476 

.012878 

.009289 

65 

.617977 

.381920 

.213820 

.113332 

.058369 

.029623 

.014923 

.013276 

.009576 

66 

.629214 

.391078 

.219665 

.116634 

.060124 

.030528 

.015382 

.013685 

.009872 

67 

.640449 

.400375 

.225645 

.120026 

.061930 

.031460 

.015855 

.014106 

.010176 

68 

.651685 

.409818 

.231767 

.123511 

.063790 

.032420 

.016343 

.014541 

.010491 

69 

.662921 

.419415 

.238039 

.127096 

.065706 

.033411 

.016847 

.014989 

.010815 

70 

.674157 

.429174 

.244469 

.130787 

.067684 

.034435 

.017368 

.015453 

.011150 

71 

.685393 

.439102 

.251069 

.134592 

.069726 

.035493 

.017907 

.015933 

.011497 

72 

.696629 

.449209 

.257847 

.138517 

.071838 

.036588 

.018464 

.016430 

.011857 

73 

.707865 

.459509 

.264816 

.142572 

.074025 

.037724 

.019043 

.016945 

.012230 

74 

.719101 

.470001 

.271990 

.146765 

.076292 

.038903 

.019644 

.017481 

.012617 

75 

.730337 

.480709 

.279382 

.151108 

.078646 

.040128 

.020269 

.018037 

.013020 

76 

.741573 

.491643 

.287008 

.155612 

.081094 

.041404 

.020920 

.018618 

.013440 

77 

.752809 

.502817 

.284888 

.160290 

.083643 

.042734 

.021600 

.019223 

.013878 

78 

.764045 

.514248 

.303041 

.165159 

.086303 

.044125 

.022311 

.019857 

.014337 

79 

.775281 

.525955 

.311490 

.170235 

.089085 

.045581 

.023056 

.020521 

.014818 

80 

.786517 

.537958 

.320263 

.175538 

.092001 

.047110 

.023839 

.021218 

.015323 

81 

.797753 

.550281 

.329389 

.181092 

.095064 

.048719 

.024663 

.021953 

.015855 

82 

.808989 

.562952 

.338904 

.186922 

.098291 

.050416 

.025534 

.022729 

.016418 

83 

.820225 

.576001 

.348848 

.193060 

.101702 

.052214 

.026457 

.023552 

.017014 

84 

.831461 

.589465 

.359270 

.199544 

.105318 

.054123 

.027438 

.024427 

.017648 

85 

.842697 

.603385 

.370226 

.206417 

.109167 

.056160 

.028486 

.025361 

.018325 

86 

.853933 

.617812 

.381787 

.213735 

.113284 

.058343 

.029610 

.026364 

.019053 

87 

.865169 

.632806 

.394035 

.221562 

.117709 

.060696 

.030823 

.027445 

.019837 

88 

.876404 

.648438 

.407074 

.229983 

.122494 

.063247 

.032140 

.028620 

.020690 

89 

.887640 

.664799 

.421035 

.239102 

.127706 

.066032 

.033580 

.029905 

.021623 

90 

.898876 

.682001 

.436086 

.249058 

.133431 

.069102 

.035170 

.031324 

.022653 

91 

.910112 

.700187 

.452448 

.260033 

.139787 

.072523 

.036944 

.032907 

.023804 

92 

.921348 

.719551 

.470426 

.272281 

.146936 

.076385 

.038951 

.034699 

.025106 

93 

.932584 

.740355 

.490446 

.286169 

.155115 

.080823 

.041263 

.036763 

.026608 

94 

.943820 

.762977 

.513150 

.302254 

.164688 

.086045 

.043990 

.039199 

.028381 

95 

.955056 

.788000 

.539566 

.321447 

.176257 

.092397 

.047318 

.042173 

.030547 

96 

.966296 

.816403 

.571518 

.345414 

.190935 

.100520 

.051590 

.045992 

.033332 

97 

.977528 

.850094 

.612823 

.377764 

.211181 

.111845 

.057580 

.051349 

.037243 

98 

.988764 

.894000 

.674424 

.429407 

.244624 

.130876 

.067732 

.060438 

.043893 
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TABLE  II 


Comparison  of  errors  associated  with  initial  estimators  when  m=100: 


TT 

k 

RMSE(7ri2)  (aVar(7iii))l/2  RMSE(jti2) 
il=.99,  e=.90  n=.95 

(aVar(7:ji))^/^  RMSE(icj2) 

,  6=.85  ri= 

(aVar(;cjj))^/2 
=.99,  0=.99 

.2 

1 

.050338 

.050338 

.057811 

.057811 

.041268 

.041268 

2 

.040459 

.034664 

.047886 

.038761 

.032148 

.030663 

4 

.065978 

.026554 

.074147 

.029590 

.054959 

.024518 

8 

.105932 

.024509 

.112164 

.028883 

.097015 

.023079 

16 

.145351  ■ 

.036706 

.149131 

.057585 

.139843 

.034473 

.1 

1 

.043989 

.043989 

.052604 

.052604 

.031671 

.031671 

2 

.026057 

.027683 

.031083 

.031883 

.020893 

.022512 

4 

.025134 

.018923 

.030325 

.021191 

.018998 

.016657 

8 

.036288 

.014342 

.040827 

.015965 

.030102 

.013209 

16 

.054294 

.012957 

.057542 

.015122 

.049629 

.012190 

.08 

1 

.042324 

.042324 

.051265 

.051265 

.028966 

.028966 

2 

.024219 

.025956 

.028764 

.030260 

.018979 

.020358 

4 

.019909 

.017233 

.024329 

.019448 

.014981 

.014834 

8 

.025966 

.012560 

.029954 

.01 1980 

.020684 

.011428 

16 

.038754 

.010541 

.041738 

.015913 

.034514 

.009867 

32 

.053940 

.012016 

.055811 

.084269 

.051211 

.011339 

.05 

1 

.039505 

.039505 

.049037 

.049037 

.024043 

.024043 

2 

.021400 

.023047 

.025675 

.027601 

.015670 

.016501 

4 

.014098 

.014460 

.017327 

.016677 

.010735 

.011714 

8 

.013781 

.009850 

.016721 

.011040 

.010238 

.008651 

16 

.018912 

.007431 

.021289 

.008268 

.015659 

.006835 

32 

.027466 

.006649 

.029112 

.007733 

.025083 

.006253 

64 

.036543 

.009056 

.037519 

.013416 

.035113 

.008523 

.02 

1 

.036221 

.036221 

.046510 

.046510 

.017294 

.017294 

2 

.018407 

.019604 

.022794 

.024590 

.010942 

.011217 

4 

.010211 

.011186 

.012562 

.013573 

.007140 

.007602 

8 

.006501 

.006837 

.008111 

.007984 

.004834 

.005334 

16 

.005587 

.004522 

.006920 

.005109 

.004060 

.003879 

32 

.006928 

.003280 

.007999 

.003646 

.005499 

.002979 

64 

.009899 

.002729 

.010667 

.003094 

.008805 

.002551 

.01 

2 

.017280 

.018287 

.021818 

.023485 

.008590 

.008720 

4 

.009076 

.009849 

.011330 

.012414 

.005449 

.005654 

8 

.005121 

.005637 

.006377 

.006849 

.003565 

.003830 

16 

.003299 

.003447 

.004150 

.004027 

.002427 

.002687 

32 

.002844 

.002278 

.003529 

.002575 

.002057 

.001954 

64 

.003499 

.001651 

.004040 

.001836 

.002776 

.001499 

.001 

16 

.002064 

.002266 

.002642 

.002927 

.000997 

.001021 

32 

.001094 

.001210 

.001388 

.001531 

.000625 

.000651 

64 

.000610 

.000677 

.000769 

.000831 

.000406 

.000435 

72 

.000556 

.000617 

.000701 

.000753 

.000379 

.000408 

100 

.000439 

.000481 

.000554 

.000577 

.000312 

.000342 

128 

.000376 

.000403 

.000475 
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.000477 

.000272 

.000302 

TABLE  III 

Comparison  of  errors  associated  with  refine  estimators  when  m=  100: 

7t  k  (aVar(r[^))*/^  (aVar(jij^j))*/^  RMSE(itj^)  (aVar(nj^))*/^  (aVar(jrj^l))^/^  RMSE(7C^) 


.01 


i1=.99, 6=.90 

ri=.99,  e=.90 

il=.99,  e=.90 

n=.99,  e=.99 

il=.99,  e=.99. 

ti=.99, 0=.99 

n'=.95,e'=.95 

n'=i.o,e'=i.o 

Ti'=  1.0,6'=  1.0 

n'=.95,0'=.95 

n'=  1.0,0'=  1.0 

ri'=  1.0,0'=  1.0 

1 

.041970 

.040251 

.040251 

.040863 

.040251 

.040251 

2 

.031433 

.030235 

.031386 

.030564 

.030235 

.031386 

4 

.025331 

.024305 

.053857 

.024495 

.024305 

.053857 

8 

.024083 

.022929 

.096092 

.023074 

.022932 

.096092 

16 

.036568 

.034079 

.139264 

.034472 

.034232 

.139264 

1 

.031814 

.030167 

.030167 

.030794 

.030167 

.030167 

2 

.022878 

.021929 

.020360 

.022272 

.021930 

.020360 

4 

.017965 

.016414 

.018436 

.016598 

.016414 

.018436 

8 

.013638 

.013090 

.029481 

.013195 

.013091 

.029481 

16 

.012709 

.012112 

.049144 

.012187 

.012111 

.049144 

1 

.028940 

.027277 

.027277 

.027894 

.027277 

.027277 

2 

.020619 

.019712 

.018415 

.020053 

.019712 

.018415 

4 

.015158 

.014573 

.014544 

.014756 

.014573 

.014544 

8 

.011766 

.011309 

.020164 

.011410 

.011309 

.020164 

16 

.010250 

.009798 

.034077 

.009863 

.009798 

.034077 

32 

.011923 

.011259 

.050925 

.011338 

.011269 

.050925 

1 

.023668 

.021910 

.021910 

.022495 

.021910 

.021910 

2 

.016566 

.015699 

.014966 

.016028 

.015699 

.014966 

4 

.011900 

.011404 

.010428 

.011584 

.011404 

.010428 

8 

.008860 

.008522 

.009912 

.008619 

.008522 

.009912 

16 

.007054 

.006772 

.015332 

.006827 

.006773 

.015332 

32 

.006516 

.006211 

.024835 

.006251 

.006212 

.024835 

64 

.009013 

.008444 

.034962 

.008522 

.008466 

.034962 

1 

.016258 

.014072 

.014072 

.014597 

.014072 

.014072 

2 

.010939 

.010002 

.009804 

.010291 

.010002 

.009804 

4 

.007589 

.007146 

.006752 

.007309 

.007146 

.006752 

8 

.005400 

.005160 

.004678 

.005251 

.005160 

.004678 

16 

.003963 

.003809 

.003925 

.003858 

.003809 

.003925 

32 

.003066 

.002947 

.005358 

.002974 

.002947 

.005358 

64 

.002651 

.002534 

.008692 

.002552 

.002535 

.008692 

2 

.008195 

.007089 

.007018 

.007355 

.007089 

.007018 

4 

.005512 

.005039 

.004891 

.005185 

.005039 

.004891 

8 

.003823 

.003600 

.003373 

.003682 

.003600 

.003373 

16 

.002720 

.002599 

.002347 

.002645 

.002599 

.002347 

32 

.001996 

.001918 

.001987 

.001943 

.001918 

.001987 

64 

.001543 

.001483 

.002704 

.001497 

.001483 

.002704 

16 

.000945 

.000797 

.000783 

.000830 

.000797 

.000783 

32 

.000628 

.000566 

.000548 

.000583 

.000566 

.000628 

64 

.000431 

.000403 

.000380 

.000413 

.000403 

.000431 

72 

.000406 

.000381 

.000356 

.000390 

.000381 

.000356 

100 

.000344 

.000325 

.000299 

.000332 

.000325 

.000299 

128 

.000304 

.000290 

.000262 

.000295 

.000290 

.000262 
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Characterization  of  Pure  and  Doped  Bij^SiO^o 


Wallace  B.  Leigh, 

Assistant  Professor,  Division  of  Electrical  Engineering, 
Alfred  University 


Abstract 

This  reports  describes  research  performed  at  Rome  Laboratories,  Hanscom  AFB,  MA  under  a  summer 
research  program.  Two  separate  areas  of  research  were  investigated.  The  first  concerned  defect  studies  of 
pure  and  doped  BiuSiO^g  (BSO)  grown  at  Rome,  and  the  characterization  of  this  material  by  way  of 
photoconductivity  and  optical  absorption  measurements.  This  is  part  of  an  ongoing  effort  to  understand  the 
photorefractive  properties  of  BSO,  particularly  the  photorefractive  response  at  longer  (larger  than  500nm) 
wavelengths  and  how  to  enhance  this  response.  Photoconductivity  measurements  were  made  on  undoped 
BSO  grown  by  Czochralski  (Cz)  and  Hydrothermal  transported  material,  as  well  as  material  doped  with 
various  impurities.  Undoped  Cz  BSO  exhibited  excellent  photoconductive  response  under  a  wide  spectral 
range.  A  peak  was  observed  in  the  photoconductivity  spectrum,  identifying  a  resonance  absorption  at 
5  lOnm.  Hydrothermal  material,  however,  exhibited  only  minor  photoconductive  response  onl\'  at  a 
wavelength  corresponding  to  the  bandgap  energy  of  BSO.  BSO  samples  doped  with  transition  metals  were 
observed  to  lose  the  majority  of  their  photoconductive  response  as  the  impurity  apparently  kills  photo¬ 
conductivity.  An  attempt  was  made  to  change  the  charge  state  of  transition  metals  by  irradiation  in  an 
effort  to  revive  the  photoconductivity.  Various  degrees  of  success  in  enhancing  photoconductivity  in  the 
red  region  of  the  spectrum  was  met  by  doping  with  A1  and  B,  and  co-doping  these  samples  with  V  and  Mn. 
Vastly  different  photoconductivity  results  were  obtained  between  Cz  and  Hydrothermal  material  doped 
with  the  same  impurity.  Various  photoconductive  quenching  was  observed  in  many  of  the  samples, 
particularly  in  the  doped  hydrothermal  material.  Bii^TiO,,,  (BTO)  material  transported  by  the 
hydrothermal  technique  was  observed  to  have  excellent  photoconductive  response,  unlike  hydrothermal 
BSO.  The  second  area  of  research  investigated  at  Rome  under  this  program  involved  the  stud\’  of  optical 
waveguides  in  BSO.  The  refractive  index  of  BSO  grown  by  various  techmques  was  measured,  and  the 
fabrication  of  waveguides  using  ion  implantation  and  liquid  phase  epitaxy  was  investigated. 
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Characterization  of  Pure  and  Doped  Bij^SiO^o 


Wallace  B.  Leigh, 

I.  Introduction 

Photorefraction  is  a  name  given  to  a  third-order  nonlinear  optical  effect  observable  in  certain 
insulators  and  semi-insulators.  Photorefraction  is  a  method  of  writing  volume  holograms  which  are 
dynamic;  that  is,  volume  gratings  which  erase  after  cessation  of  the  writing  beams.  Time  constants  of  the 
gratings  may  range  from  several  milliseconds  to  several  months.  Thus  photorefracting  crystals  can  be  used 
for  a  variety  of  suitable  applications  of  non-linear  mixing  including  active  spatial  filtering,  phase 
conjugation  by  way  of  degenerate  four-wave  mixing,  optical  processing  devices  such  as  optical  oscillators, 
3iid  as  a  t6chxiic|U6  useful  for  non-destructive  materials  analysis. 

In  this  study,  photorefraction  and  general  defect  properties  were  observed  in  cubic  Bij2Si02o 
(BSO).  Introductory  studies  of  BSO  on  single  crystals  grown  by  Czochralski  and  by  the  hydrothermal 
method  are  a  part  of  a  large  effort  to  relate  defect  properties  of  BSO  to  its  photoreffactive  behavior.' 
Emphasis  of  the  study  is  to  isolate  impurity/defect  phenomena  in  respect  to  photoreffactive  quality  and 

demonstrate  the  photoreffactive  technique  as  a  method  of  nondestructive  and  contactless  testing  of  this 
material. 

Photorefraction  is  unique  in  that  it  is  a  nonlinear  effect  the  sensitivity  of  which  does  not  depend  on 
the  absolute  intensity  of  the  laser  which  writes  the  grating.  Thus  the  technique  should  be  observable  with 
small  (less  than  lOOmW)  power  lasers.  In,  fact,  what  would  be  most  desirable  would  be  the  ability  to  tailor 
a  crystal  for  any  desirable  photorefraction  application,  for  example  applications  involving  low-power  red  or 
infrared  sources.  The  advantages  of  using  these  sources  for  photorefraction  applications  are  obwous.  in 
terms  of  cost  and  ease  of  portability  of  the  device. 

Dopmg  studies  at  Rome  Laboratories  of  BSO  were  introduced  as  a  method  of  altering  the  spectral 
dependence  of  absorption  and  photoconductivity.  This  is  especially  desirable  for  fabricating 

photoreffactive  samples  from  hydrothermal  BSO,  as  undoped  hydrothermal  material  is  not  normally 
photorefracting. 

There  is  also  an  mterest  m  preparing  waveguides  in  photorefracting  materials.  Photorefracting 
waveguides  are  easier  to  mterface  mto  certain  optoelectronic  applications  than  bulk  photorefracting 
devices.  An  example  would  be  an  application  involving  light-guided  fiber  optics.  The  speed  of 
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photorefractive  response  should  also  be  higher  in  photorefractive  waveguides  due  to  the  higher  optical  field 
density'  for  the  same  optical  power. 

n.  Photoconductivity 

An  optimum  photoreffactor  should  be  fairly  transparent  at  the  wavelength  of  interest,  to  allow  for 
the  grating  to  be  written  in  the  full  volume  of  the  material,  but  at  the  same  time  not  too  transparent  as 
phoconductivity  is  also  dependent  on  the  absorption.  There  is  a  trade-off.  therefore  to  make  use  of  extrinsic 
photoconductivity  with  the  proper  amount  of  absorption  without  severely  attenuating  the  optical  beam  as 
it  passes  through  the  sample.  To  examine  an  elective  sample  for  photorefraction  requires  a  study  of  both 
the  spectral  dependence  of  absorption  and  photoconductivity. 

Ohmic  contacts  for  TSC  and  photoconductivity  measurements  were  made  by  applying  indium 
strips  to  the  polished  surfaces  of  samples  1  cm  on  a  side.  Samples  were  heated  in  air  at  150°C  for  15 
minutes  to  alloy  the  contacts.  Samples  were  then  mounted  on  a  sample  holder  kept  at  room  temperature. 
Very  low  (typically  100  V/cm)  electric  fields  were  used  in  TSC  and  photoconductivity  to  minimize  the 
influence  of  internal  space  charge  generated  electric  fields  on  the  electrical  measurements. 

Spectral  photoconductiviU'  and  absorption  measurements  were  made  with  a  1/4  m  monochrometer 
with  a  2  nm  bandpass,  and  a  lOOOW  Xenon  arc  lamp  as  a  source.  Spectral  measurements  were  made  from 
long  to  short  and  from  short  to  long  wavelengths  to  ensure  no  photochromatic  transitions  were  taking  place. 
Figure  1  shows  spectral  dependence  of  the  DC  photoconductivity  for  two  crystals  grown  by  the  different 
processes.  In  both  cases,  the  photocurrent  flux  has  been  normalized  to  the  incident  light  power  density 
which  averaged  ~5mW/cm^  The  Cz  sample  exhibits  photoconductivity  over  a  wide  range,  from  600nm 
to  well  over  the  band  edge,  which  contributes  to  useful  photorefraction  over  a  wide  range  of  pumping 
source  wavelengths.  A  peak  is  observed  in  the  DC  photoconductivity  for  all  undoped  Cz  samples, 
illustrating  perhaps,  an  energy-dependent  capture  cross-section.  In  each  case  the  peak,  which  represents  an 
absorption  resonance,  is  centered  close  to  5  lOnm.  The  hydrothermal  material  is  only  slightly 
photoconducting  at  an  energy  which,  from  absorption  data,  is  associated  with  the  band-to-band  transition. 
Even  under  intrinsic  illumination,  the  photoconductivity  of  the  hydrothermal  material  is  still  two  orders  of 
magnitude  smaller  than  for  Cz  grown  BSD. 

We  have  also  observed  that  not  only  the  optically  induced  conductivity  but  also  the  dark 
conductivity  is  different  in  hydrothermal  and  Cz  material.  BSD  is  known  to  possess  thermally  activated 
current  instabilities  due  to  charge  injection  from  contacts".  These  instabilities  are  most  often  observed  at 
temperatures  above  100°C  or  at  high  applied  fields.  We  have  also  observed  a  thermally  activated  dark 
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conductivity  in  both  Cz  and  hydrothermal  BSO  with  an  activation  energy  of  1 .2  eV  for  both  material 

types.  The  thermally  activated  dark  conductivity  is,  however,  three  orders  of  magnitude  smaller  in 
hydrothermal  as  in  Cz  BSO. 

We  observed  a^ficantly  smaller  optical- and  thermal-induced  conductivities  in  the  hydrothermal 
material.  A  lower  defect  content  also  appears  to  inhibit  the  conduction  processes  for  the  hydrothermal 
material,  in  terms  of  missing  traps  responsible  for  a  hopping  conduction  process.  A  hopping  conduction 
process  has  been  proposed  as  a  conduction  mechanism  in  BSO'  ^  The  hopping  model  assumes 
conduction  takes  place  via  a  light  induced  mechanism  from  a  filled  to  an  empty  neighboring  trap  rather 

via  band  transport  from  a  photoexcited  carrier.  The  identity  of  the  traps  involved  in  hopping  conduction 
in  BSO  is  not  yet  known. 


in.  Photoconductivity  of  doped  BSO  and  BTO. 

Photoconductivity  measurements  were  also  undertaken  to  measure  the  spectral  response  of  the 
photo-induced  current  of  BSO  doped  with  a  variety  of  impurities.  Hie  purpose  here  is  to  investigate  the 
possibility  of  changing  the  shape  of  the  photoconductivity  spectral  response  curve  and  enhancing  the 
photoconductivity  in  specific  areas  of  the  response  curve.  Table  I  shows  doped  samples  investigated  for 
photoconductive  response.  Cz-BSO  co-doped  with  Mn  and  A1  appeared  green  in  color.  Sample  used  was 
full  of  veils  but  both  sides  were  polished.  Samples  of  Cz  BSO  doped  only  with  Mn  were  reddish  brown 
and  clear,  although  not  very  transparent.  Iron  compound  BFO  is  black  and  has  shown  no  photoconductiviy. 


Table  I.  Doped  BSO  used  in 
photoconductivity  analysis. 

Sample  Impurity 

BSO  110  none 

BSO  160  Mn+Al 

BSO  168  Mn  only 

BS45  Cr 

(hydrothermal) 
BSO  128  Al+Cr 

BSO  127  La 

BSO  150  A1 


Table  I.  (cont) 

BSO  183 

V+B 

BS0139 

V+Al 

BSO  55 

none 

(from  hydro  charge) 

BT2 

Hydrothermal  BTO 

Ge  implanted  BSO 

BFO 

iron  compound 
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Results 


In  all  cases,  doping  BSO  reduced  the  overall  magnitude  of  the  photoconductivity,  as  could  be 
expected.  Figure  2  shows  results  of  selected  samples  of  Cz  BSO  containii^  various  dopants  and 
co-dopants.  Samples  of  undoped  Cz  BSO  showed  excellent  photoconductivity ,  while  those  doped  with 
transition  metals  show  that  doping  has  killed  the  pc  in  that  material.  A  weak  photoconductive  shoulder 
appears  in  the  CZ-BSO:Al,Mn  sample.  It  is  difficult  to  compare  absolute  scales  between  various  samples, 
however,  it  appears  that  at  least  the  A1  and  La  doped  material  should  be  investigated  again  for 
photoreffaction  using  red  pump  beams.  Of  the  other  samples  in  Table  I,  each  exhibited  only  extremely 
weak  photoconductivity. 

Of  the  the  samples,  the  co-doped  Al-Mn  sample  appears  to  have  the  highest  resistivity  and  is 
difficult  to  make  ohmic  contacts  to.  An  effort  may  be  undertaken  to  improve  contacts  to  this  material. 

The  four  doped  samples  of  Fig.  2  have  been  sent  to  be  irradiated  and  will  be  irradiated  with  I  OK 
initially  of  gamma  rays  and  stored  in  dry  ice.  While  irradiating  samples  does  not  always  increase  the 
photorefi^tion  efficiency  at  longer  wavelengths.  It  does  not  effect  the  efficiencies  at  shorter  wavelengths 
and  speeds  of  er^ure  can  be  enhanced  for  erasure  by  a  long-wavelength  probe. 

Samples  of  BTO  grown  by  hydrothermal  transport,  while  not  suitable  for  transmission 
measurements,  could  be  used  for  photoconductivity  analysis.  Figure  3  shows  a  PC  scan  of  a  hydrothermal 
BTO  sample.  It  is  interesting  to  compare  this  spectra  with  the  hydrothermal  BSO  of  Fig,  1,  which  contains 
no  color  and  little  photoconductivity. 

Evidence  of  photoconductive  bleaching  could  be  observed  in  chrome  doped  hydrothermal  material. 
A  PC  spectra  for  sample  BS  60  is  shown  in  Fig.  4.  While  scanning  into  the  blue  and  UV  region  of  the 
spectrum,  the  photoconductivity  would  disappear.  Heating  the  sample  on  a  hot  plate  for  a  few  minutes 
would  restore  the  photoconductivity  to  the  sample.  Bleaching  rates  were  quite  high  for  this  sample,  and  a 
time  constant  could  not  be  measured  with  the  scanning  equipment  used. 

rV.  Refractive  Index  Measurements 

Refractive  index  measurements  were  made  on  various  samples  of  BSO  prepared  by  Czochralski 
and  Hydrothermal  transport.  Motivation  of  this  work  is  the  lack  of  color  in  the  hydrothermal  material. 
Smce  the  Cz  BSO  absorbs  with  a  500nm  absorption  edge  and  the  hydrothermal  does  not,  it  is  expected 
that  the  refractive  index  near  the  500nm  region  may  be  higher  in  the  Czochralski  than  in  the  Hydrothermal 
matenal.  Thin  films  of  yellow  BSO  on  transparent  hydrothermal  substrates  should  make  excellent 
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Table  II  Results  of  refractive  index  measurements  of  BSO.  Mrasurements  are  estimated  to  be 
within  ±  0.03 


Cz  BSO 

Hydrothermal  BSO 

Wavelength 

Sample  1 

Nut  for  BSO50 

Sample  2 

BSO  HOE 

Sample  1 

IlOh 

Sample  2 

BS  60A 

480  nm 

2.71 

2.67 

2.67 

2.66 

546  nm 

2.62 

2.59 

2.59 

2J9 

620  nm 

N/A 

2.53 

N/A 

N/A 

photorefractive  waveguides.  If  this  is  the  case,  then  the  hydrothermal  material  should  prove  to  be  excellent 
substrate  for  the  growth  of  thin  films  of  yellow  BSO  by  liquid-phase  epitaxy. 

Refractive  index  measurements  were  made  using  the  minimum  deviaticm  of  refracted  angle  of  a 
prism.*  Two  prism  samples  of  apex  angle  9°  were  made  in  each  of  Czochralski  and  hydrothermal  material. 
A  Hg/Cd  arc  lamp  was  used  as  the  source,  and  a  high-resolution  goniometer  was  used  to  detect  the 
refitted  angle.  Care  was  taken  to  align  the  prism  feces  and  several  measurements  were  made  of  apex 
angle  and  refracted  angle  for  each  sample. 

Results  of  these  measurements  are  shown  in  Table  II.  Sample  1  of  each  material  type  were  prisms 
made  by  the  author  at  Rome  while  sample2  of  each  were  made  by  a  commercial  supplier.  Because  the  red 
line  of  the  Hg/Cd  lamp  is  the  weakest  line,  measurements  at  620nm  could  not  always  be  made  for  all 
samples. 

Measured  refractive  indexes  are  centered  around  2.6,  which  is  the  value  determined  from 
absorption  at  infrared  wavelengths.  BSO  does  not  show  a  large  amount  of  energy  dependent  dispersion  at 
the  three  optical  energies  used  for  these  measurements.  Contrary  to  what  was  predicted,  not  much 
difference  is  observed  between  the  hydrothermal  and  Czochralski  refractive  index  for  the  480nm  line. 
However,  the  blue  line  at  480nm  and  the  green  line  at  546  nm  may  both  be  outside  of  the  resonance  line  of 
5  lOnm  as  determined  from  photoconductivity,  above.  Further  measurements  of  refiactive  index  with  a 
5  lOnm  source  are  planned  for  the  future. 
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V.  Implanted  BSO 

Samples  of  Cz  and  Hydrothermal  BSO  were  ion-implanted  for  two  purposes:  1)  As  an  attempt  to 
introduce  impurities  that  could  not  normally  be  introduced  by  melt  during  growth  or  by  diffusion.  The 
intent  was  again  to  alter  the  photoconductivity  in  such  a  way  as  to  increase  the  photorefracting  properties 
of  the  materials  at  longer  wavelengths.  2)  As  a  possible  way  to  produce  waveguides  on  BSO.  By 
implanting  a  deep  (at  least  lOOOA)  distance  into  the  sample,  the  intent  was  to  prepare  a  low  index  damaged 
region  with  a  range  as  deep  as  possible  into  the  sample,  which  could  be  used  to  form  a  waveguide  in  the 
higher  index  material  left  undamaged.  This  is  the  same  method  as  has  been  used  previously  to  prepare 
photorefracting  waveguides  in  BaTiOj  and  KNbOj. 

Implant  conditions  for  BSO  appear  in  Table  III.  Implanted  samples  were  sent  to  a  commercial 
laboratory  for  impurity  profile  analysis.  Impurity  profiles  as  determined  by  secondary  ion  mass 
spectroscopy  (SIMS)  is  shown  in  Fig.  5.  The  results  show  that  implanting  is  a  viable  method  for 
mtroducmg  impurities  in  BSO.  The  data  shows  an  implant  range  of  ~  lOOOA  for  Ge  implanted  into  Cz 

BSO  and  only  ~  lOOA  for  the  same  impurity  implanted  into  hydrothermal  material.  It  is  not  known  at  this 
early  stage  why  there  is  such  a  significant  difference  in  the  range. 

Before  and  after  photoconductivity  analysis  of  the  Cz  sample  demonstrated  that  the  implant  had 
only  a  slight  affect  on  the  photoconductivity  spectrum.  More  studies,  including  large  implant  doses 
followed  by  impurity  drive-in,  are  planned. 

VI.  Liquid  Phase  Epitaxy 

Initial  studies  of  liquid-phase  epitaxy  was  prepared  on  some  samples  by  growing  Bij^TiO^o  (BTO) 
on  BSO  substrates.  This  particular  system  was  selected  as  an  initial  study  of  epitaxy  as  the  melting 
temperature  of  BTO  (860T  )  are  less  than  that  for  BSO  (900*0.  LPE  growth  runs  were  carried  out  in  the 
same  growth  system  used  to  grow  Czochralski  BSO.  The  charge  was  melted,  supercooled,  then  a  BSO 


Table  III.  Implant  conditions  of  BSO 

Present  Vaue  Dose 

Set  Value  Dose 

Elapsed  Time 

Energy 

Beam  Current 


2X10'®  ions/cm^ 
2X10'®  ions/cm* 
1.75X10®  sec. 
150  kV 
43  |xA. 
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substrate  was  slowly  lowered  into  the  melt.  The  melt  was  supercooled  ~30°C.  The  seed  was  kept  in  the 
supercooled  melt  for  30minutes. 

Initial  runs  made  demonstrated  some  growth  of  material  on  the  BSO  substrates.  However  the 
quality  of  the  films  were  not,  as  yet  high  enough  for  characterization  studies.  Future  work  is  planned  on 
enhancing  the  quality  of  the  films  by  a  polishing  study  of  the  substrates,  and  a  more  ctetailed  refractive 
index  study  to  look  at  alternative  film  compositions. 
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CONDUCTED  INTERFERENCE  SIMULATION  RESULTS  FOR  A 
GENERAL  ELECTRIC  SOFT  PART  ANALOGOUS  MODULE  (SPAM) 


RickJ.  Levin 

Electromagnetic  Environmental  Effects  Laboratory 
Georgia  Tech  Research  Institute 

Abstract 


The  Soft  Part  Analogous  Module  (SPAM)  [1]  is  a  high  speed  1-megabit  RAM  that  was 
built  by  the  General  Electric  (GE)  Corporation  to  verify  multi-chip-module  (MCM)  fabrication 
techniques  and  processes  prior  to  full  construction  of  radiation  hardened  MCM’s.  Conducted 
Electromagnetic  Interference  (EMI)  measurements  were  performed  on  this  MCM  as  well  as  on 
the  individual  integrated  circuits  (IC’s)  within  the  MCM  [2].  The  individual  IC’s  were  measured 
in  order  to  determine  if  simpler  and  less  expensive  IC’s  could  be  evaluated  prior  to  testing  a 
more  complex  and  costly  MCM.  Conducted  EMI  simulations  for  two  of  the  MCM  interface 
IC’s  are  presented  in  this  report.  SPICE  circuit  analysis  was  used  to  simulate  conducted  EMI. 
The  utilization  of  Harmonic  Balance  circuit  analysis  to  simulate  conducted  EMI  is  discussed  in 
the  Appendix. 
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CONDUCTED  INTERFERENCE  SIMULATION  RESULTS  FOR  A 
GENERAL  ELECTRIC  SOFT  PART  ANALOGOUS  MODULE  (SPAM) 


Rick  J.  Levin 


1.0  INTRODUCTION 


The  Soft  Part  Analogous  Module  (SPAM)  [1]  is  a  high  speed  1-megabit  RAM  that  was 
built  by  the  General  Electric  (GE)  Corporation  to  verify  multi-chip-module  (MCM)  fabrication 
techniques  and  processes  prior  to  full  construction  of  radiation  hardened  MCM’s.  Conducted 
Electromagnetic  Interference  (EMI)  measurements  were  performed  on  this  MCM  as  well  as  on 
the  individual  integrated  circuits  (IC’s)  within  the  MCM  [2].  The  individual  IC’s  were  measured 
in  order  to  determine  if  simpler  and  less  expensive  IC’s  could  be  evaluated  prior  to  testing  a 
more  complex  and  costly  MCM.  Conducted  EMI  simulations  for  two  of  the  MCM  interface 
IC’s  are  presented  in  this  report.  SPICE  circuit  analysis  was  used  to  simulate  conducted  EMI, 
The  utilization  of  Harmonic  Balance  circuit  analysis  to  simulate  conducted  EMI  is  discussed  in 
the  Appendix. 

The  GE  SPAM  MCM  contains  the  dies  of  four  IC’s:  the  Texas  Instruments  74S00 
NAND  gate,  74S138  3-to-8  line  decoder,  74S373  D-flip-flop,  and  Performance  Semiconductor 
P4C164  8kx8  static  CMOS  RAM.  In  order  to  determine  if  circuit  simulations  of  conducted  EMI 
can  accurately  predict  measured  results,  the  Texas  Instruments  74S00  and  74S138  IC’s  were 
simulated  during  this  effort.  Conducted  EMI  was  simulated  with  a  continuous  wave  (CW)  single 
frequency  interference  waveform  over  the  interference  frequency  range  of  10  MHz  to  800  MHz. 
The  conducted  EMI  measurement  results  [2]  showed  that  the  74S00  two  inputs  are  equally 
susceptible  and  exhibited  no  unusual  responses  to  conducted  EMI.  The  six  74S138  inputs 
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exhibited  similar  susceptibility  levels  up  to  approximately  90  MHz,  while  the  two  enable  inputs 
exhibited  a  sharp  and  unusual  increase  in  the  signal  level  required  to  cause  susceptibility  at 
approximately  90  MHz.  The  conducted  EMI  waveforms  were  simulated  as  if  injected  on  the 
74S00  "Al"  input  and  74S138  "G2B"  (enable)  input. 

2.0  SPICE  Models 


Circuit  schematics  for  the  Texas  Instruments  IC’s  were  obtained  from  data  books  [3,4] 
and  from  the  manufacturer  [5].  The  TI 74S  series  of  IC’s  are  Schottky  technology,  which  makes 
extensive  use  of  Schottky  barrier  diodes  and  Schottky-clamped  transistors.  The  Schottky- 
clamped  transistor  is  a  transistor  with  a  Schottky  barrier  diode  attached  from  base  to  collector. 
Multiple  emitter  transistors  were  modeled  as  multiple  transistors  with  the  base  and  collectors  tied 
together. 

Detailed  transistor  and  diode  parameter  models  were  not  available  for  the  TI  74S  series 
of  IC’s,  which  is  an  old  technology  that  is  not  supported  by  TI.  The  74S  was  modeled  by  the 
following  representative  transistor  and  diodes:  the  2N3932  (NPN)  transistor,  the  1N914 
switching  diode,  and  the  1N5711  Schottky  barrier  diode.  The  default  parameter  of  1  was  used 
for  the  junction  areas  of  all  the  transistors  and  diodes  on  the  IC’s. 

The  74S00  contains  four  identical  independent  2-input  NAND  gates.  Each  NAND  gate 
contains  six  transistors  and  seven  diodes.  The  single  NAND  gate  illustrated  in  Figure  1  was 
used  in  the  simulations.  The  functional  (digital)  input  signals  were  represented  by  voltage 
sources  set  to  correspond  to  a  1  MHz  clock,  10  nsec  rise  and  fall  times,  0.4  Volts  logic  0  (low), 
2.4  Volts  logic  1  (high),  and  the  input  binary  patterns  presented  in  the  truth  tables  of  the  IC  data 
sheets  [3] .  The  IC  output  was  terminated  with  a  1  kOhm  resistor.  The  input  CW  interference 
waveform  was  simulated  with  a  CW  voltage  source.  A  divide  by  two  voltage  divider  was 
utilized  to  add  (combine)  the  interference  and  digital  input  signals.  To  compensate  for  the 
division  in  voltage,  the  interference  and  digital  source  amplitudes  (prior  to  the  voltage  divider) 
were  doubled. 
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Figure  1.  74S00  NAND  Gate  Schematic 


The  74S138  IC  is  a  3-line  to  8-line  decoder  that  contains  over  100  transistors.  The 
74S138  circuit  was  partitioned  into  3  input  blocks,  1  enable  block,  and  8  output  blocks,  as 
illustrated  in  Figure  2.  The  input  interference  waveforms,  input  digital  waveforms,  and  output 
terminations  were  simulated  in  the  same  manner  as  described  for  the  74S00. 
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Figure  2.  74S138  3-Lme  to  8-Line  Decoder  Schematic  (1/2) 
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Figure  2.  74S138  3-Line  to  8-Line  Decoder  Schematic  (2/2) 
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3.0  SPICE  Simulations 


The  SPICE  simulations  were  performed  with  PSPICE  [6]  software  on  a  33  MHz  486  PC 
with  8  Mbyte  of  RAM.  Results  of  the  74S00  simulations  were  reviewed  to  determine  the 
susceptibility  levels  at  the  different  interference  frequencies.  The  susceptibility  level  is  the  input 
interference  voltage  level  that  causes  a  false  logic  state  change  in  the  output  (as  compared  to  the 
IC  output  truth  table  [3]).  A  false  logic  state  occurs  when  the  output  voltage  of  a  logic  1  dips 
below  1.4  Volts  or  when  the  output  voltage  of  a  logic  0  rises  above  1.4  Volts  (which  is  the  same 
as  the  measurement  criteria).  Additionally,  the  output  waveform  was  sampled  for  false  logic  at 
mid-clock  period  (multiples  of  0.5  /^seconds)  this  corresponds  to  the  sampling  used  in  the 
measurements.  The  simulation  susceptibility  voltage  levels  were  converted  to  incident 
interference  power  levels  expressed  in  dBm  and  plotted  along  with  the  measured  susceptibilities 
of  the  74S00  as  illustrated  in  Figure  3.  For  interference  frequencies  less  than  approximately  300 
MHz,  the  simulated  and  measured  results  compare  well,  -t-3.5/-1.0  dB.  The  simulated 
susceptibilities  leveled-off  at  approximately  1 1  dBm  for  frequencies  greater  than  300  MHz,  while 
the  measured  susceptibility  levels  increased  with  frequencies  above  300  MHz.  The  poor 
comparison  at  high  frequencies  may  be  due:  1)  the  use  of  general  purpose  transistor  and  diode 
model  parameter,  2)  The  effects  of  the  IC  package/layout  (which  was  not  modeled). 

The  simulation  results  were  also  reviewed  to  determine  the  EMI  effect  as  the  input 
interference  voltage  level  was  increased  to  the  point  of  susceptibility.  The  dominant  interference 
effect  in  the  74S00  IC  simulations  is  that  the  output  logic  1  is  pulled  to  a  logic  0.  This  is  the 
same  effect  observed  in  the  measurement  results  [2].  The  74S00  output  voltage  waveforms  for 
10  MHz  input  interference  voltage  waveforms  with  different  input  peak-to-peak  voltage  levels 
ranging  from  1.0  to  1.6  Volts  are  shown  in  Figure  4.  The  74S00  output  voltage  waveforms  for 
60  MHz  input  interference  voltage  waveforms  with  different  input  peak-to-peak  voltage  levels 
ranging  from  1.5  to  2.5  Volts  are  shown  in  Figure  5. 
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Incident  Interference  Thresholds  for  74S00,  Pin  A1. 


Frequency  (MHz) 


Figure  3.  74S00  Conducted  EMI  Susceptibility 
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Figure  4.  74S00  Conducted  EMI  Effects  for  10  MHz 


A  1.4  Volt  output  logic  threshold  and  mid-clock  period  sampling  were  used  to  assess 
false  logic  states  for  the  74S138  simulations.  The  simulation  susceptibility  voltage  levels  were 
converted  to  incident  interference  power  levels  and  plotted  along  with  the  measured  susceptibility 
levels  of  the  74S138  as  illustrated  in  Figure  6.  The  simulation  results  did  not  show  the  sharp 
increase  in  the  input  power  level  required  to  cause  susceptibility,  at  approximately  90  MHz,  that 
was  observed  in  the  measurements.  The  2N3932  transistor  model  parameters  were  used  to 
represent  the  74S138  transistor  model  parameters,  since  the  74S138  transistor  model  parameters 
were  not  available.  A  few  of  the  2N3932  transistor  model  parameters  were  modified  as  follows: 


Ideal  Maximum  Forward  Beta 

Bf 

=  90 

Ideal  Maximum  Reverse  Beta 

B. 

=  2.5 

Zero-Bias  Base  Resistance 

Rb 

=  21 

Collector  Ohmic  Resistance 

Rc 

=  26 

Emitter  Ohmic  Resistance 

R. 

=  1.31 

Forward  Early  Voltage 

=  90 

The  74S138  simulations  were  repeated  with  the  modified  2N3932  transistor  model 
parameters.  For  interference  frequencies  less  than  approximately  100  MHz,  the  simulated  and 
measured  results  compare  well,  +!-  2.5  dB  and  the  sharp  increase  in  the  input  power  level 
required  to  cause  susceptibility,  at  approximately  90  MHz,  was  simulated.  The  simulated 
susceptibilities  started  to  level-off  at  approximately  100  MHz,  while  the  measured  susceptibility 
levels  increased  with  frequencies  above  100  MHz.  The  dominant  interference  effect  in  the 
74S138  IC  simulations  is  that  the  output  logic  0  is  pulled  to  a  logic  1.  This  is  the  same  effect 
observed  in  the  measurement  results  [2].  The  74S138  output  voltage  waveforms  for  10  MHz 
input  interference  voltage  waveforms  with  different  input  peak-to-peak  voltage  levels  ranging 
from  1.0  to  1.5  Volts  are  shown  in  Figure  7.  The  74S138  output  voltage  waveforms  for  60 
MHz  input  interference  voltage  waveforms  with  different  input  peak-to-peak  voltage  levels 
ranging  from  1.5  to  2.0  Volts  are  shown  in  Figure  8. 
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Incident  Interference  Thresholds  for  74S138AN,  Pin  G2B. 


Frequency  (MHz) 


Figure  6.  74S138  Conducted  EMI  Susceptibility 
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Figure  8.  74S138  Conducted  EMI  Effects  for  60  MHz. 


4.0  Conclusions  and  Recommendations 


The  conducted  EMI  effects  and  susceptibility  levels  of  the  74S00  and  74S138  IC’s  within 

the  MCM  can  be  simulated  with  SPICE.  The  SPICE  simulations  were  accurate  within  +3.5/- 

2.5  dB  for  frequencies  less  than  100  MHz.  The  correct  transistor  and  diode  model  parameters 

may  increase  the  frequency  range  at  which  the  simulations  provide  accurate  results. 

It  is  recommended  that  this  research  effort  be  extended  to  include  the  following: 

1)  A  comparison  of  conducted  EMI  susceptibility  simulations  to  measurements  for  various 
technologies  (e.g.,  AS,  LS,  ALS,  and  CMOS)  of  the  74  series  of  IC’s. 

2)  A  comparison  of  conducted  EMI  susceptibility  simulations  to  measurements  for  IC’s  at 
different  temperatures. 

3)  An  investigation  of  IC  reliability,  through  a  comparison  of  internal  IC  power/voltage 
levels  from  parametric  simulations  with  respect  to  input  conducted  EMI  levels  and  IC 
temperature. 
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Appendix.  Comments  on  Harmonic  Balance  Simulations. 

Early  in  this  effort,  Harmonic  Balance  (HB)  circuit  analysis  techniques  were  also 
considered  for  IC  conducted  EMI  simulations.  SPICE  and  HB  simulations  were  conducted  on 
an  example  circuit  and  compared  to  measurements.  Libra  [7]  HB  software  and  PSPICE  [6] 
SPICE  software  were  used  in  the  simulations.  A  National  Semiconductor  DS8820/7820 
differential  line  receiver  IC  was  used  in  example  simulations,  since  transistor  model  parameters, 
schematics,  and  measurements  were  available  from  a  previous  effort  [8].  The  DS8820/7820  IC 
contains  ten  2N3932  transistors  and  no  diodes. 

The  differential  line  receiver  compares  two  input  signals  and  outputs  a  digital  signal 
based  on  the  comparison.  One  of  the  two  inputs  was  connected  to  ground  for  the  simulations 
and  a  1  MHz  periodic  pulse  waveform  and  a  CW  interference  waveform  were  both  connected 
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to  the  other  input.  The  PSPICE  simulations  converged  and  were  similar  to  the  measurements 
performed  on  the  previous  effort.  The  Libra  simulations  would  not  converge  with  the 
interference  source  connected.  In  order  to  have  the  simulation  converge,  the  interference  source 
was  removed  from  the  circuit  and  the  libra  parameters  controlling  convergence  and  accuracy 
were  varied.  After  some  doing  the  HB  simulations  (without  interference)  converged,  but  were 
not  similar  to  the  measurements  (without  interference)  performed  on  the  previous  effort. 

HB  circuit  analysis  [9]  is  described  in  order  to  obtain  insight  into  the  HB  convergence 
and  accuracy  during  IC  simulations.  HB  is  a  mixed  domain  circuit  analysis  technique  that 
segments  the  circuit  into  linear  and  nonlinear  elements.  The  linear  elements  form  a  multi-ported 
network  described  by  S-  or  Y-parameter  matrices.  The  nonlinear  elements  are  modeled  by  their 
large-signal  I/V  or  Q/V  characteristics.  The  linear  elements  are  analyzed  in  the  frequency 
domain  and  the  nonlinear  elements  are  analyzed  in  the  time  domain.  The  HB  technique  satisfies 
Kirchhoffs  current  law  at  the  port  nodes  separating  the  linear  and  nonlinear  subcircuits.  First, 
an  initial  guess  is  made  of  the  time  voltage  waveforms  at  each  of  the  port  nodes.  The  time 
voltage  waveforms  are  then  Fourier  transformed  into  the  frequency  domain  components 
corresponding  to  the  harmonics  of  the  excitation  frequency  (including  the  DC  term  and 
fundamental).  The  linear  sub-circuit  is  then  solved  in  the  frequency  domain  for  the  port  node 
currents  of  each  harmonic.  The  nonlinear  subcircuit  is  analyzed  using  the  initial  time  voltage 
waveforms.  The  resulting  time  current  waveform  is  then  Fourier  transformed  into  the  frequency 
domain  components  corresponding  to  the  harmonics  of  the  excitation  frequency  (including  the 
DC  term  and  fundamental).  At  each  port  node  and  for  each  harmonic  of  the  excitation 
frequency  (including  the  DC  and  fundamental)  the  addition  of  the  linear  and  nonlinear  current 
waveforms  will  equal  0  if  the  initial  voltage  waveforms  were  a  good  guess.  If  the  addition  of 
the  linear  and  nonlinear  currents  are  not  within  the  error  tolerance,  then  the  process  is  repeated 
with  new  time  voltage  waveforms.  Newtons-method  is  a  popular  algorithm  for  solving  the  time 
waveforms  in  successive  iterc  ns  due  to  its  good  convergence  properties  and  efficiency. 

The  DS8820/7820  differential  line  driver,  which  contains  ten  transistors,  had  HB 
convergence  and  accuracy  problems.  A  combination  of  factors  contribute  to  these  problems,  the 
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number  of  nonlinear  elements,  the  number  of  harmonics  required  to  adequately  represent  the 
input  waveforms,  and  the  number  of  harmonics  required  to  adequately  represent  the  nonlinearity 
of  the  circuit  elements.  These  problems  are  further  exacerbated  when  the  fundamental  excitation 
frequencies  used  to  represent  the  functional  and/or  interference  waveforms  are  not  harmonically 
related.  Multiple  excitation  frequencies  will  increase  the  number  of  frequency  components  that 
need  to  be  balanced  at  each  port  node  since  each  excitation  frequency  has  different  harmonics 
and  the  nonlinear  elements  will  produce  mixing  products  of  the  different  excitation  frequencies. 
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GHz  MICROWAVE  RADIATION  BY  PICOSECONDS  OPTICAL  PULSES 


by 

David  Liu 


ABSTRACT 

High  power,  directional  l-20GHz  microwave  radiation  has  been  generated  out  of  high 
resistivity  by  50  picoseconds  optical  pulses.  The  radiation  signal  generated  as  well  as  the 
associated  mechanisms  were  monitored  directly  on  the  sampling  oscilloscope  in  real  time.  The 
phase,  amplitude,  and  direction  of  the  radiation  can  be  effectively  manoeuvered  by  optical 
method.  The  combination  of  optics  and  microwave  of  this  innovative  technique  provides  the 
potential  of  flexibility,  simplicity  and  low  cost  in  microwave  applications. 
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INTRODUCTION  &  TECHNICAL  BACKGROUND 

The  advent  of  picoseconds  laser  pulses  has  made  it  possible  to  generate  GHz  microwave 
signals  via  the  use  of  optical  switching  methods  [1].  Photoconducting  switches  consist  of 
normally  insulating  doped-semiconductor  material  between  two  electrodes.  Illumination  by  a 
laser  pulse  free  some  electrons  bound  to  the  atoms  of  the  material  and  enable  them  to  conduct 
electrical  current.  Fast  response  time  of  the  order  of  picoseconds  has  been  achieved  out  of  this 
principle.  Thus,  microwave  signals  can  be  normally  generated  by  photoconductive  shorting  the 
charged  coplanar  microstrip  transmission  lines  with  short  laser  pulses.  The  electrostatic  energy 
stored  in  the  transmission  lines  is  thus  converted  into  the  energy  of  microwave  radiation. 

The  results  of  this  technique  give  the  frequency  of  millimeter-wave  burst  is 
approximately  55GHz  (corresponding  to  a  period  of  18ps)  by  the  measurements  of  Auston  and 
coworkers  [2].  Based  upon  this  principle,  a  hybrid  device,  so  called  the  frozen  wave  generator 
[3]  for  the  radiation  signal  in  the  5GHz  frequency  regime  was  developed.  A  schematic  of  a 
basic  five-cycle  frozen  wave  generator  is  illustrated  in  Fig.  1.  Initially,  as  all  the  six  switches 
are  open,  all  the  six  elements  of  transmission  line  are  charged  to  potentials  ±Vo  as  shown.  This 
static  condition  gives  the  "frozen  wave".  When  all  the  switches  are  simultaneously  closed,  the 
frozen  wave  is  free  to  propagate.  If  the  switches  are  closed  not  simultaneously,  but  in  a  certain 
synchronous  manner,  a  different  output  wave  form  will  be  produced.  The  adequate  timing  for 
closing  the  switches  can  be  most  effectively  achieved  by  the  optical  path  lengths  adjustment 
in  order  to  provide  the  desired  modulated  output  signals.  Lately,  Thaxter  [4]  at  Rome 
Laboratory,  Hanscom  AFB  has  demonstrated  the  success  of  frozen  wave  generator  in  the  5GHz 
regime  by  incorporating  with  single  mode  optical  fibers. 

The  energy  source  of  radiation  previously  discussed  has  been  contributed  by  the  electro¬ 
static  energy  stored  in  the  microstrip  transmission  lines.  Recently,  Hu  and  coworkers  [5] 
pioneered  a  new  approach  by  utilizing  photoconducting  material  itself  as  the  energy  source  for 
emitting  microwave  radiation.  By  illuminating  with  colliding  pulse,  mode-locked  (CPM)  75fs 
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short  pulses,  large  planar  photoconductors  were  found  to  radiate  a  directional  electromagnetic 
pulse  in  space,  which  can  be  steered  by  varying  the  angle  of  incidence  of  the  optical  beam.  The 
short  photocurrent  generated  in  a  voltage  biased  photoconductor  by  the  optical  pulse  serves  as 
the  radiation  source.  The  photocurrent  signal  is  a  short  pulse,  whose  frequency  spectrum 
extends  from  dc  to  ITHz.  It  has  a  median  frequency  of  ~600GHz,  corresponding  to  a  free-space 
wavelength  of  500//m.  Zhang  and  Auston  [6]  went  further  by  optically  inducing  THz 
electromagnetic  beams  steered  by  controlling  the  pattern  structure  with  two  degrees  of 
freedom.  A  semi-insulating  InP  wafer  illuminated  by  the  unfocused  optical  beam  was  used  as 
the  emitter.  The  temporal  measurement  was  achieved  using  the  technique  of  optical  sampling 
by  varying  the  time  delay  between  the  excited  laser  pulse  illuminated  on  the  semiconductor 
surface  and  the  trigger  laser  pulse  focused  on  the  photoconducting  detector. 

By  adopting  the  similar  idea,  this  summer  we  utilized  high  power  (peak  power  >  50KW). 
mode-locked,  Q-switched  50ps  laser  pulses,  instead  the  75  fs  CPM  pulses  by  Zhang  and  Auston 
mentioned  earlier,  to  generate  microwave  pulse  in  the  lOGHz  regime.  The  uniqueness  and  main 
objectives  of  this  work  are  outlined  as  follows: 

(1)  The  frequency  range  of  radiation  we  try  to  generate  is  of  current  use  in  radar 
communication.  1-20  GHz  frequency  range  is  most  active  in  microwave  communication 
technology  today. 

(2)  The  high  power  laser  pulse  and  high  quality  semi-insulating  InP  materials  available 
at  Rome  Laboratory,  Hanscom  AFB  are  ideal  for  us  to  conduct  this  research. 

(3)  Taking  advantage  of  the  20GHz  resolution  of  the  our  sampling  oscilloscope,  we  are 
able  to  observe  the  microwave  radiation  profile  directly  on  the  scope  screen.  This  allows 
us  to  analyze  the  data  in  a  more  effective  way  than  the  similar  work  previously 
reported. 

(4)  Again,  the  equipment  in  hand  allows  us  to  seek  the  answers  as  to  the  roles  of 
conducting  current  and  displacement  current  in  microwave  generation  under  this 
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scheme. 


(5)  Incorporating  the  attractive  features  of  both  frozen  wave  generator  and  large- 
aperture  photoconducting  antenna,  the  experiment  setup  provides  flexibility  and  higher 
degree  of  freedom  for  generating  directional,  high  powered,  modulated  microwave 
signals. 


EXPERIMENTAL 

The  schematic  diagram  for  the  experimental  setup  is  illustrated  in  Fig.  2,  3.  The  mode 
locked,  Q-switched,  frequency-doubled  YLF  laser  system  provides  an  output  of  4pJ  pulse 
energy,  lOOKW  peak  power,  50ps  pulse  duration  with  a  center  wavelength  at  527nm.  The  laser 
pulses  were  selectively  chosen  by  a  Pockel  cell  with  a  repetitive  rate  of  400Hz.  The  laser  beam 
was  split  into  two  after  the  frequency  doubler  with  approximately  a  1:10  ratio.  The  weak  beam 
was  fed  into  an  ultrafast  photodetector  for  triggering  the  TEK11802  sampling  scope.  The  main 
beam  was  positioned  to  pass  through  an  optical  path  delay  line  of  about  15m  (equivalent  to  a 
50ns  time  delay)  before  reaching  the  sample  in  order  to  bypassing  the  delay  line  on  the  scope 
for  the  best  detecting  temporal  resolution  of  20GHz.  For  investigating  the  nonlinear  and 
interference  effect,  the  main  laser  beam  was  divided  equally  into  two  and  then  be  guided  by 
two  optical  fibers.  The  electric  pulses  with  an  amplitude  of  200V  and  width  of  5ps  were 
amplified  by  a  TV  transformer  to  come  up  a  lOKV  bias  voltage  across  the  sample.  Standard  S- 
band,  X-band,  K-band  and  self-made  wide  band  wave  guide  transducers  were  used  to  measure 
the  microwave  radiation  in  real  time.  Microwave  spectral  analyzer  was  taken  as  the 
complementary  tool  for  determining  the  spectral  distribution  for  the  radiation  signal  received. 
High  quality  Fe  doped  InP  and  Fe,  Zn  doped  InP  grown  under  stabilization  magnetic  field  at 
Rome  Laboratory  were  preferentially  taken  as  the  samples  (microwave  emitters)  for  this 
experiment. 
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RESULTS  AND  DISCUSSION 


The  radiated  electric  field  due  to  the  localized  photoconducting  current  can  be 
expressed  as  [7] 


£=-£  Ti  + 

'’cose..  *  " 


1 


cos6j  COS0J 


(1) 


E(,  is  the  bias  field,  is  the  characteristic  impedance  of  free  space,  6;  is  the  angle  of  incidence 
of  the  optical  beam,  0j  is  the  angle  of  the  inward  propagating  electric  pulse,  e  is  the  dielectric 
constant  of  the  photoconductor,  and  is  the  surface  photoconductivity,  given  by 


(»-<•) 


eu(l-R)cos0,  , 
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where  p  is  the  carrier  mobility  of  the  photoconductor,  R  is  the  optical  reflectivity,  htl)  is  the 
photon  energy,  I^p  is  the  optical  intensity,  and  is  the  photocarrier  decay  time.  From  equation 
(1)  we  see  that  at  low  optical  excitation,  the  electric  field  is  proportional  to  the  optical 
intensity. 

For  the  purpose  of  avoiding  signal  contamination  caused  by  the  attached  conducting 
wires  and  establishing  more  localized  current  source  on  the  sample,  the  electrical  contacts  to 
the  photoconductor  are  made  as  not  ohmic. 

Typical  signal  received  by  the  wide  band  detector  under  the  low  optical  excitation  over 
the  entire  sample  is  shown  in  Fig.  4.  Like  a  replica  of  the  optical  laser  pulse,  the  microwave 
pulse  with  a  temporal  width  of  ~50ps  was  observed.  The  broad  band  nature  of  the  radiation 
microwave  pulse  can  be  verified  by  using  narrow  band  wave  guide  detectors.  As  shown  in  Fig 
5-7,  signals  were  received  by  S-band,  X-band,  K-band  detector  respectively.  The  respective  peak 
frequency  revealed  the  maximum  efficiency  of  the  combination  of  the  wave  guide  spectral 
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efficiency  and  the  signal  spectral  distribution.  The  signals  were  further  investigated  by  using 
microwave  analyzer,  the  peak  frequency  of  about  12GHz  was  observed  (not  shown),  of  which 
the  convolution  is  consistent  with  the  pulse  temporal  width  of  50ps. 

As  the  illuminating  beam  was  more  focused  on  the  spot  near  electrodes,  a  stronger 
radiation  occurred.  As  predicted  by  Auston  [8],  a  strong  electric  field  near  the  electrodes  will 
remain,  which  can  have  a  significant  influence  on  the  transport  properties  of  electrons  and 
holes  in  the  sample.  The  static  approximation  for  E  in  this  case  can  be  determined  by  conformal 
mapping  and  theoretically  can  be  expressed  as 


£(x,z=0)=— 

Tti 


l-(jf 


(3) 


where  Vg  is  the  bias  voltage  and  (  is  the  length  between  the  electrodes.  It  is  a  minimum  at 
center  of  the  gap,  and  increases  to  infinity  at  the  metal  electrodes.  The  experimental  technique 
provides  a  powerful  and  convenient  tool  for  mapping  the  static  electric  field  distribution  on 
the  semiconductor  surface. 

As  the  illuminating  beam  was  better  focused  on  the  sample,  the  excitation  intensity  was 
enhanced.  The  radiation  amplitude  was  reduced  by  attenuating  the  laser  beam  intensity. 
However,  the  variation  was  not  in  a  linear  fashion,  saturation  and  nonlinear  behavior  occurred, 
as  shown  in  Fig.  7&8.  This  might  suggest  that  the  electric  field  on  the  sample  has  been  depleted 
at  large  as  the  laser  intensity  is  above  certain  value  -  that  was  approximately  5MW/ cm^  for  our 
measurement.  As  mentioned  earlier,  the  radiation  energy  is  due  to  the  conversion  of  the  static 
energy  (eE^/2)  in  the  sample.  The  sample  in  this  sense  can  be  considered  as  a  combination  of 
a  capacitor  and  a  resistor  with  varying  resistance.  The  depletion  of  the  electric  field  will 
meanwhile  keep  a  low  intrinsic  resistance  for  a  longer  time  duration.  As  a  consequence,  the 
signals  will  become  more  oscillatory  (less  damped).  The  observed  larger  "overshoot"  signal  under 
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higher  optical  illumination  (Fig.  7)  supports  the  qualitative  description  stated  above.  In 
addition,  the  short  time  constant  of  RC  due  to  reduced  R  induced  an  unsymmetric  microwave 
signal  profile,  of  which  the  temporal  width  was  even  less  than  that  of  incident  optical  pulse. 

The  microwave  radiation  signal  can  be  effectively  modulated  by  optical  means,  which 
offers  simplicity  and  low  cost.  Such  a  feature  was  demonstrated  by  using  two  optical  sources 
guided  by  single  mode  fibers.  Fig.  9  shows  the  radiation  signal  under  single  optical  illumination 
(Fig.  9,  (a)&(c))  and  dual  optical  illumination  (Fig.  9,  (b)).  Adjustment  of  the  optical  path  length 
difference  corresponds  to  the  phase  shift  of  the  microwave  signal.  The  superposition  nature  of 
output  microwave  radiation  under  dual  illumination  and  the  geometrical  pattern  illuminated 
on  the  InP  sample  by  the  optical  signals  provides  the  degrees  of  freedom  to  steer  as  well  as 
modulate  the  radiation  output. 
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SUMMARY 


•  l~20GHz  microwave  radiation  was  successfully  generated  by  picoseconds  optical  pulses.  The 
output  signal  as  well  as  the  associated  mechanisms  were  monitored  for  the  first  time  by 
20GHz  sampling  oscilloscope  in  real  time. 

•  The  energy  source  of  radiation  was  contributed  by  the  static  electric  energy  not  from  the 
transmission  lines,  but  from  the  sample.  Nonlinear  effect  due  to  electric  field  depletion  in 
the  sample  was  observed. 

•  High  quality  Fe  doped  InP  and  Fe,  Zn  doped  InP  grown  under  stabilization  magnetic  field 
at  Rome  Laboratory  were  preferentially  chosen  as  the  radiation  source  due  to  material 
homogeneity  and  high  dark  resistivity  (  >10®Qcm). 

•  The  direction,  phase  and  amplitude  profile  of  radiation  output  were  effectively  controlled 

by  optical  means. 

•  The  simplicity,  flexibility  and  the  low  cost  offered  by  this  technique  merit  new  application 
in  radar  communication  as  well  as  electronic  testing  device,  such  as,  generation  of  microwave 
radiation  in  space,  electronic  field  mapping  on  the  semiconductor  wafer. .etc. 
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Fig.  1  Frozen  wave  generator  schematic. 
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Fig.  2  Schematic  diagram  for  the  experimental  setup. 
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Fig.  3  Wideband  GHz  pulse  radiation  out  of  electric  pulsed  bias  InP:Fe  wafer. 
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Fig.  4  A  50  picosecond  microwave  pulse  generated  by  50  picosecond  laser  pulse. 
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Fig.  5  Microwave  output  signal  received  by  S-band  ridge  waveguide  transducer. 
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Fig.  6  Microwave  output  signal  received  by  K-band  ridge  waveguide  transducer. 
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Fig.  7  Nonlinear  effect  due  to  the  depletion  of  the  electric  field  in  InP  wafer  under 
intensive  illumination. 
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Fig.  8  Microwave  radiation  intensity  vs.  optical  illumination  intensity.  Nonlinear  behavior  was 
observed.  The  solid  line  is  a  guide  to  the  eyes. 
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Fig.  9  Microwave  signals  generated  by  single  illumination  ((a)&(c))  and  dual  illumination  (b). 
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Abstract 

For  one-step,  in-situ  synthesis  of  phosphorus  vapor  and  indium  melt,  and  liquid- 
encapsulated  Czochralski  growth  of  InP  crystals  to  succeed  and  produce  single  crystals  of 
uniform  quality  and  at  lesser  cost,  it  is  important  to  develop  a  basic  understanding  of  the 
mechanics  of  heat  transfer  and  gas  flow  in  a  high  pressure  crystal  growth  (HPCG)  furnace.  A 
series  of  experiments  performed  recently  in  order  to  characterize  the  temperature  conditions  at 
the  graphite  susceptor  and  cmcible  walls  is  reported  here.  These  results  demonstrate  the  vari¬ 
ations  in  temperature  prof;.cs  w-‘  the  power  input  as  well  as  with  the  crucible's  vertical 
location.  A  computer  model  to  ^dy  the  gas  flow  in  an  HPCG  furnace  has  also  been 
developed.  Numerical  computations  are  performed  u*  ;.:;amine  the  gas  flow  and  temperature 
fields  with  and  without  the  presence  of  phosphorus  iiijcctor  inside  the  furnace.  Complex  recir- 
culatory  flows  which  become  oscillatory  at  high  Grashof  numbers,  are  produced  by  the  buoy¬ 
ancy  forces.  The  temperature  variations  are  generally  very  strong  in  the  vicinity  of  the  encap- 
sulant  surface,  and  weak  in  the  outer  region  which  agree  qualitatively  with  the  temperature 
measurements. 
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AN  EXPERIMENTAL  AND  NUMERICAL  STUDY  OF  THERMAL  CONDITIONS 
IN  A  HIGH  PRESSURE  SYSTEM  FOR  INDIUM  PHOSPHIDE  CRYSTAL  GROWTH 

V.  Prasad  and  A.  P.  Anselmo 


INTRODUCTION 

In  order  to  grow  single  crystals  of  III-V  compounds  such  as  gallium  arsenide,  indium 
phosphide  and  so  on,  a  high  pressure  crystal  growth  (HPCG)  system  is  used.  The  mechanisms 
of  energy  transport  in  such  furnaces  are  very  different  from  that  in  the  low  pressure 
Czochralski  puller  for  silicon  crystals.  While  in  a  low  pressure  system,  the  energy  is  trans¬ 
ferred  to  the  melt  and  from  the  melt  and  crystal,  primarily  by  conduction  and  radiation,  gas 
convection  plays  an  important  role  together  with  other  modes  of  heat  transfer  in  determining 
the  thermal  field  within  an  HPCG  puller.  The  buoyancy-driven  convective  flows  in  a  high 
pressure  chamber  are  turbulent  and  oscillatory,  and  the  recirculating  gas  flow  pattern  is  a 
strong  function  of  the  geometric  configuration,  temperatures  of  the  r.f.  coil,  susceptor,  melt, 
crystal  and  puller  walls,  and  the  gas  properties.  In  addition,  the  convective  heat  transfer  in  an 
HPCG  system  is  strongly  coupled  with  the  conduction  in  various  components  of  this  system 
and  radiation  exchange  between  the  surfaces  which  can  see  each  other. 

The  heat  transfer  mechanism  is  further  complicated  by  the  process  employed  to  grow 
the  single  crystal.  Motivated  by  the  high  cost  of  indium  phosphide  crystals  grown  by  a  two- 
step  method  -  the  synthesis  of  polycrystal  by  the  horizontal  Bridgman  method  and  single 
crystal  growth  by  the  liquid-encapsulated  Czochralski  (LEC)  method,  a  one-step  operation  has 
been  proposed  and  investigated  by  several  researchers  [1-11].  It  is  basically  an  LEC  method 
for  pulling  a  single  crystal  from  an  "in-situ"  synthesized  melt  of  indium  phosphide.  Recently, 
magnetically  stabilized  liquid-encapsulated  Czochralski  (MLEC)  and  Kyropoulos  (MLEK) 
growth  of  InP  single  crystals  have  also  been  reported  [12,13]. 

For  synthesis,  the  heat  radiated  by  the  melt  is  absorbed  by  the  solid  phosphorus  for 
melting  and/or  sublimation.  The  convective  heat  transfer  also  helps.  The  temperature  of  the 
phosphorus  can  therefore  be  controlled  by  changing  the  vertical  location  of  the  injector.  The 
ampoule  is  placed  directly  above  the  melt  and  the  vapor  is  transferred  to  indium  melt  by  a 
quartz  tube  whose  length  is  another  key  variable  in  the  "in  situ"  synthesis.  The  heat  transfer 
phenomena  during  the  in-situ  synthesis  is  therefore  highly  transient.  The  movement  of  P- 
injector  changes  the  temperature  field  significantly  by  modifying  the  radiation  exchange 
between  various  surfaces  and  by  changing  the  convective  gas  flow  rate  and  its  structure.  The 
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energy  generated  into  the  melt  due  to  the  reaction  of  In  and  P,  and  the  volume  increase  (p,„  = 
7.31  and  p,„p  =4.787)  also  affect  the  heat  transfer  and  gas  velocity. 

For  one-step,  in-situ  synthesis  and  growth  process  to  succeed  and  produce  InP  crystals 
of  uniform  quality  and  at  lesser  costs,  it  is  important  to  understand  the  mechanics  of  heat 
transfer  and  gas  flow  in  an  HPCG  furnace.  This  can  be  achieved  only  by  a  well-coordinated 
experimental  and  numerical  research  program.  During  this  summer,  we  developed  a  model  to 
analyze  the  gas  flow  inside  the  HPCG  furnace,  and  have  performed  simulations  for  a  number 
of  selected  cases.  The  computed  flow  fields  support  the  observed  flow  phenomena  in  the 
furnace  [14,15].  Also,  whenever  a  model  is  used  to  simulate  the  crystal  growth  process,  a 
selection  of  suitable  boundary  conditions  poses  the  biggest  problem.  Special  experiments  were 
therefore  designed  and  conducted  to  determine  the  boundary  conditions  on  the  crucible  wall  as 
well  as  the  relationship  between  the  power  input  and  the  wall  temperature  variations  as 
functions  of  the  crucible's  location. 

DESCRIPTION  OF  THE  HPCG  FURNACE 

The  HPCG  furnace  (Fig.  1)  used  for  ongoing  research  at  Rome  Laboratory  uses  a  96 
mm  i.d.  quartz  crucible  or  a  PBN  cup  of  2  mm  thick  wall,  placed  in  a  113  mm  o.d.  graphite 
susceptor.  The  susceptor  is  insulated  from  the  r.f.  coil  by  "Fiberfrax  Lo-Con  Felt"  insulation 
placed  in  an  annular  space  formed  by  two  quartz  cylinders.  An  opaque  glass  plate  of  280  mm 
diameter  separates  the  bottom  portion  of  the  furnace  from  its  upper  region.  However,  the  gas 
can  flow  downward  through  a  2  mm  gap  between  this  plate  and  cold  puller  wall.  The  plate  is 
placed  on  top  of  the  quartz  cylinders,  305  mm  from  the  bottom  surface  of  the  puller,  while  the 
mean  height  of  the  upper  tube  of  the  r.f.  coil  is  286  mm.  The  graphite  susceptor  with  crucible 
can  be  moved  up  or  down,  and  can  be  rotated  by  a  shaft.  The  shaft  holds  a  boron  nitride  (BN) 
hollow  cylinder  which,  in  turn,  supports  the  susceptor.  A  tungsten  sheathed  W26Re-W.5Re 
thermocouple  probe  passes  through  the  shaft  and  measures  temperature  at  the  crucible  bottom. 

A  75  mm  o.d.,  1(X)  mm  long  quartz  ampoule  is  used  for  phosphorus  storage  and  its 
vapor  injection  into  the  indium  melt.  This  injector  has  an  off-center  tube  of  12  mm  i.d.  for  the 
vapor  outflow.  The  injector  can  be  hung  from  the  top  with  the  help  of  a  12  mm  i.d.  hollow 
tube  through  which  a  cooled  seed  shaft  can  pass.  The  injector  can  be  moved  up  or  down 
independent  of  the  seed  shaft.  The  gap  (about  2  mm)  between  the  hollow,  support  tube  and 
the  seed  shaft  can  allow  the  cold  gases  to  flow  downward  along  the  seed  shaft.  To  insulate  the 
phosphorus  injector  from  heat  losses,  8  mm  Lo-Con  felt  insulation  is  placed  between  the 
injector  and  an  inverted  quartz  cup  (Fig.  1). 
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A  quartz  cylinder  of  103  mm  i.d.  is  used  as  a  heat  shield.  However,  the  heat  shield 
cannot  completely  eliminate  the  buoyancy-induced  flows  -  gas  flows  downward  along  the  seed 
shaft  and  upward  along  the  quartz  shield  -  it  can  reduce  the  gas  velocity  significantly.  The  heat 
shield  rests  on  the  graphite  susceptor  and  therefore  can  rotate  with  the  crucible.  Further  details 
of  the  HPCG  furnace  may  >ound  sewhere  [14,15]. 

There  are  two  viewing  por  in  this  HPCG  puller,  one  is  used  for  video  monitoring  and 
photography  while  the  other  helps  in  visual  obs  ation.  Since  the  system  is  run  at  a  high 
temperature  (above  1063 °C)  and  a  high  pressui  1 500-600  psi)  at  which  the  density  of  nitrogen 
is  about  six  to  fifteen  times  larger  than  at  the  atmospheric  condition,  it  is  possible  to  see  the 
direction  of  flow  whenever  the  gas  flows  over  a  step.  Light  reflections  also  seem  to  help  in 
this  visualization.  Although  only  a  small  portion  of  the  growth  chamber  is  visible,  it  is 
possible  to  draw  flow  patterns  qualitatively  from  the  observed  gas  flow  behavior  [14,15]. 

EXPERIMENTAL  WORK 

A  series  of  experiments  to  study  the  thermal  conditions  in  this  HPCG  furnace  was 
conducted  earlier  and  has  already  been  reported  in  the  literature  [15].  The  purpose  of  the 
present  experiment  is  to  measure  temperature  variations  on  the  crucible  wall  and  the  outer 
surface  of  the  susceptor,  and  to  determine  their  relationship  with  the  power  input  as  well  as 
with  the  crucible's  location.  Several  thermocouples  were  therefore  placed  outside  the  susceptor 
and  inside  the  crucible  as  shown  in  Fig.  2.  This  was  the  first  time  that  11  thermocouples  were 
used  to  record  temperature  distributions  in  this  system.  A  conex  fitting  was  used  to  carry  eight 
thermocouple  extension  wires.  However,  this  was  not  a  simple  job,  because  placing  the 
thermocouples  at  right  locations  and  holding  them  there  during  the  experiments  are  not  easy 
tasks.  Inner  thermocouples  were  placed  inside  thin  quartz  tubes  attached  to  the  crucible  wall, 
to  prevent  them  from  damage  from  the  melt  as  well  as  to  hold  them  in  place.  In  order  not  to 
alter  the  susceptor's  characteristics,  outer  thermocouples  were  placed  on  the  outside  of  the 
inner  quartz  cylinder.  We  also  connected  a  personal  computer  to  the  existing  Omega  strip  chart 
recorder  to  monitor  and  record  temperature  continuously.  Software  were  developed  to  collect 
the  temperature  information  and  store  them  for  further  analysis.  We  were  therefore  able  to  add 
some  permanent  features  to  the  existing  system  which  can  help  in  future  experiments. 

First,  the  crucible  was  held  in  its  ce  nral  location  (@  ?  cm)  and  temperatures  were 
recorded  for  three  different  power  inputs.  Next,  the  power  was  kept  fixed  and  the  crucible  was 
moved  up  or  down  to  study  the  effect  of  its  location.  Figure  3  presents  the  crucible's  location 
with  respect  to  the  r.  f.  coil.  Sufficient  time  was  allowed  after  every  change  in  either  power  or 
location,  for  the  system  to  reach  (quasi)  steady-state.  From  the  temperature  data  collected,  it  is 
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jple  locations  for  temperature  measurements  on  the  inside  of  the 
id  outside  of  the  graphite  susceptor. 
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possible  to  study  the  transients  inside  the  HPCG  furnace.  However,  this  report  will  present 
temperatures  only  under  steady  conditions  (Figs.  4  and  5). 

We  also  designed  and  conducted  two  other  sets  of  experiments.  The  objective  of  the 
first  experiment  was  to  study  the  effects  of  a  graphite  cover  placed  on  top  of  the  phosphorus 
injector.  This  was  observed  to  suppress  the  temperature  fluctuations  [14,15]  on  the  injector  and 
temperature  rise  at  the  bottom  and  top  of  the  injector  was  very  smooth.  However,  the  rate  of 
phosphorus  vaporization  was  slow  which  did  not  help  the  synthesis  process.  The  second  set  of 
experiments  was  conducted  to  examine  the  effect  of  a  reduction  in  radiation  loss  from  the  melt. 
The  space  between  the  furnace  wall  and  the  heat  shield  (146  mm  dia.  in  this  case)  was  filled 
with  an  insulation,  placed  between  two  concentric  cylinders.  This  reduced  the  radiation  loss 
significantly,  and  a  larger  temperature  rise  was  observed  at  lower  powers.  A  detailed  analysis 
of  these  results  will  be  presented  later. 

Figures  4(a)  and  4(b)  present  the  temperature  rise  with  power.  As  can  be  seen  the 
temperature  increases  monotonically  with  the  power,  and  the  nature  of  the  curve  does  not 
change.  Indeed  the  curves  are  almost  parallel  and  support  our  earlier  observation  [14,15]  that 
the  melt  temperature  changes  almost  linearly  with  power.  In  Fig.  4(a),  the  thermocouple  #  2 
reads  the  largest  temperature  which  is  slightly  higher  than  that  at  1.  The  temperature, 
however,  drops  with  the  height,  moving  up  from  #  2  to  #  5.  This  can  be  easily  explained  by 
considering  the  heat  loss  from  the  susceptor.  Thermocouple  #5  is  in  a  position  where  the 
susceptor  loses  heat  directly  to  the  furnace  walls  and  other  components  by  radiation  whereas 
the  energy  in  the  middle  portion  of  the  susceptor  is  transferred  to  the  melt  only  by  conduction. 
The  temperature  in  the  lower  portion  of  the  susceptor  is  smaller  because  of  heat  loss  to  the 
bottom  portion  of  the  furnace.  Figure  4(b)  confirms  this  behavior  by  demonstrating  that  the 
temperature  is  lowest  for  #  7  and  largest  for  #  8.  (Unfortunately,  thermocouples  6  and  10 
stopped  working  soon  after  the  experiment  was  started.) 

Figure  5  demonstrates  the  effects  of  the  crucible's  location  (central  location  @2cm).  It 
is  observed  that  the  temperature  of  the  susceptor  generally  increases  as  it  is  moved  up  (Fig. 

5a).  The  only  exception  is  the  bottom,  thermocouple  ^1.  Figure  5(a)  further  shows  that  the 
temperature  at  the  top  of  the  susceptor  is  much  lower  than  that  at  its  bottom  when  the 
susceptor  is  at  its  bottom  most  location,  a  temperature  drop  of  600°C.  This  temperature 
variation  decreases  as  the  susceptor  is  moved  up,  less  than  200°C  at  the  topmost  location.  This 
does  not  mean  that  the  topmost  location  is  the  most  desirable  one.  As  shown  in  Fig.  5(b)  by 
thermocouple  #  7,  the  melt  temperature  may  be  lower  when  the  susceptor  is  at  its  top  location. 
It  seems  that  a  central  location  somewhere  between  2  and  3  cm  is  better  for  the  melt. 
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NUMERICAL  STUDY  OF  GAS  FLOW  IN  AN  HPCG  FURNACE 


As  reported  earlier  [14,15],  strong  convective  flows  are  induced  in  an  HPCG  furnace. 
The  flow  is  oscillatory  and  turbulent,  and  has  a  complex  pattern.  A  numerical  research 
program  was  therefore  initiated  to  study  the  gas  flow  phenomena  in  a  high  pressure  system. 
Governing  Equations.  The  model  used  here  to  analyze  the  gas  flow  assumes  that  the  flow  is 
incompressible  and  laminar  (turbulent  flows  may  be  considered  in  the  future),  the  Boussinesq 
approximation  is  applicable,  and  the  viscous  dissipation  is  negligible.  The  gas  flow  is  consid¬ 
ered  to  be  axisymmetric,  and  therefore,  only  two  dimensional  calculations  are  performed. 
With  these  assumptions,  the  non-dimensional  governing  equations  can  be  written  as  follows: 
Continuity; 


19  dw 


/^-Momentum: 


du  du  du  dp  \  d  du  d'^u  u 


Z-Momentum: 


du)  dw  dw  dp  I  d  .  dw  d'^w 


Energy: 


90  90  90  Wl  ^ 

9t  ^  97-  ^  9z  “  Fr  V7-97-^’’97'^‘^  9z2 


(1) 

(2) 

(3) 

(4) 


which  employ  the  following  scales: 

length:  b,  time:  b^/a  ,  velocity:  r/b,  and  pressure:  pv^lb^. 

Here,  the  dimensionless  velocities  in  radial  (r)  and  axial  (z)  directions  are  represented  by  u  and 
w,  respectively,  while  p  and  t  denotes  the  pressure  and  time.  The  dimensionless  temperature, 
0,  is  defined  as  (T  -  Tc)/(Th  -  Tc),  where  Tj,  and  Tq  are  the  encapsulant  surface  and  wall 
temperatures,  respectively.  Here,  p,  a,  and  are  kinematic  viscosity,  thermal  diffusivity  and 
isobaric  coefficient  of  thermal  expansion,  respectively.  The  crucible  radius,  b,  has  been  chosen 
as  the  primary  length  dimension.  The  governing  parameters  Gr  and  Pr  are  defined  as: 

Grashof  number,  Gr  =  gi8b3(Th  -  Tc)/»'2  (5) 

Prandtl  number,  Pr  =  p/a  (6) 
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The  flow  boundary  conditions  are  relatively  well-defined  since  the  gas  occupies  a 
specifically  prescribed  control  volume  with  rigid  boundaries  except  the  encapsulant  surface. 

For  present  calculations,  the  encapsulant  surface  has  also  been  assumed  to  be  rigid.  No-slip 
conditions  are  therefore  employed  on  all  boundaries.  However,  the  thermal  boundary 
conditions  are  not  as  accurately  known,  since  precise  conditions  within  the  real  systems  are  not 
known.  For  present  calculations,  we  consider  0  =  1  at  the  encapsulant  surface  while  the 
furnace  walls  are  assumed  to  be  at  0  =  0.  More  realistic  boundary  conditions  based  on 
temperature  measurements  may  be  employed  in  future. 

In  this  numerical  algorithm,  the  solid  region  within  the  computational  domain  is  intro¬ 
duced  by  imposing  an  extremely  large  artificial  viscosity  in  that  region  so  that  no  flow  can  take 
place  in  this  area.  Heat  conduction  is  allowed  to  take  place  through  the  partitions  walls/heat 
shields.  This  does  not  pose  any  difficulty  since  the  energy  equation  (4)  automatically  reduces 
to  the  conduction  equation  when  the  velocities  are  zero.  Note  that  the  thermophysical 
properties  can  be  allowed  to  vary  from  grid  to  grid  [16,17]. 

Numerical  Scheme.  The  governing  equations  (l)-(4)  are  discretized  using  the  control  volume 
integral  approach  and  a  staggered  grid.  It  is  Patankar’s  SIMPLER  method  which  is  used  to 
solve  the  equations  [18].  In  this  scheme,  each  term  is  integrated  over  its  volume  element.  In 
a  similar  manner,  the  continuity  equation  is  integrated  over  the  same  control  element,  then 
multiplied  by  the  dependent  variable  of  the  equation  being  discretized.  The  discretized  continu¬ 
ity  equation  is  eventually  subtracted  from  the  discretized  momentum  or  energy  equation.  This 
manipulation  enables  the  flow  field  to  always  satisfy  mass  conservation.  Two  other  equations, 
the  pressure  and  pressure-correction  equations  are  obtained  by  substituting  the  discretized 
momentum  equations  into  the  continuity  equation.  The  latter  has  the  task  of  correcting  the 
velocities  during  iterations,  while  the  former  improves  the  pressure  field.  The  SIMPLER 
method  is  based  on  a  semi-implicit  procedure.  In  the  present  algorithm,  the  convective  fluxes 
are  approximated  by  the  power  law  scheme  [16-18]  which  is  one  of  the  most  robust 
algorithms.  It  approximates  the  central  difference  when  velocities  are  small,  and  gradually 
mimics  the  upwind  difference  when  velocities  becomes  large. 

The  discretized  equations  are  underrelaxed  and  solved  iteratively,  within  each  time  step 
to  resolve  the  coupling  and  nonlinearity  of  the  flow  and  energy  equations,  using  a  line-by-line 
tri-diagonal  matrix  algorithm.  Sweeps  are  performed  in  both  forward  and  backward  directions 
so  that  the  effects  in  one  part  of  the  computational  domain  may  be  felt  throughout  the  entire 
domain  without  bias.  Convergence  at  each  time  step  is  declared  when  the  mass  source  and  the 
residuals  of  the  discretized  equations  are  less  than  a  specified  absolute  relative  tolerance.  Strict 
convergence  within  each  time  interval  is  crucial  for  the  accuracy  of  the  final  solutions, 
particularly  when  the  flow  is  oscillatory.  Since  the  temperature  gradients  are  generally  strong 
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in  the  vicinity  of  the  furnace  walls,  the  quartz  shields  and  the  encapsulant  surface,  a  suitable 
non-uniform  grid  distribution  is  used  for  the  present  computations.  Only  limited  trials  runs 
were  carried  out  to  optimize  the  grid  distribution,  convergence  criteria  and  time  step  since  our 
objective  here  is  to  obtain  some  qualitative  information  first  and  see  how  do  they  compare  with 
the  observed  flows. 

Numerical  Results.  Figures  6  and  7  present  the  flow  and  temperature  fields  in  half  of  the 
cylindrical  enclosure  with  a  partition  for  Gr  =  10^  and  5x10^,  respectively.  For  these  calcula¬ 
tions,  the  gas  properties  are  assumed  to  be  constant.  As  can  be  seen  in  Fig.  6a,  there  exist  two 
main  recirculating  cells,  one  on  top  of  the  encapsulant  (rotating  counter-clockwise)  and  the 
other  on  the  outside  of  the  partition  wall  extending  to  the  top  (rotating  clockwise).  The 
mechanism  of  formation  of  these  cells  is  straight-forward.  The  gas  is  heated  by  the  encapsualnt 
surface  and  moves  up  along  the  quartz  shield  since  the  cold  gas  flows  downward  along  the 
seed  rod  which  is  at  a  lower  temperature  than  the  B2O3  surface  (Fig.  1).  On  the  outside  of  the 
quartz  shield,  the  hot  gas  rises  along  its  surface  and  flows  downward  along  the  cold  furnace 
wall.  This  is  a  basic  flow  pattern  which  is  induced  by  the  thermal  conditions  in  this  HPCG 
furnace  and  agrees  with  the  visualization  reported  earlier  [14,15]. 

Figure  6(b)  also  shows  that  a  large  temperature  gradient  exists  near  the  B2O3  surface 
and  even  temperature  inversion  is  possible.  The  temperature  variations  in  the  region  outside  of 
the  quartz  shield  and  in  the  upper  portion  of  the  furnace  are  weak.  However,  a  strong  flow  is 
induced  in  this  region  primarily  because  of  the  shear  effects  produced  by  the  inner  cell,  and  a 
strong  buoyancy  force,  since  the  length  scale  is  very  large.  Note  that  the  Grashof  number  is 
proportional  to  the  third  power  of  the  length  (equation  5). 

Figure  7  (Gr  =  5x10"^)  demonstrates  the  effect  of  an  increase  in  the  buoyancy  force. 
Although  the  primary  flow  pattern  remains  the  same,  the  convective  flow  oscillates  at  this 
Grashof  number  (Fig.  7a).  Several  secondary  cells  whose  shapes  and  sizes  change  with  time 
are  produced  and  the  flow  becomes  extremely  complex.  This  supports  our  earlier  observation 
that  the  flow  in  an  HPCG  furnace  is  oscillatory  [14,15].  Note  that  this  Gr  is  much  lower  than 
the  real  value  encountered  in  an  HPCG  furnace,  and  at  higher  Grashof  numbers,  the  flow  can 
certainly  become  chaotic  and  turbulent.  Figure  7b  further  shows  that  with  an  increase  in 
Grashof  number,  the  plume  effect  has  been  extended  much  beyond  the  vicinity  of  the  B2O3 
surface  producing  large  temperature  gradients  in  other  regions  (compare  with  Fig.  6b). 

In  the  above  calculations,  the  presence  of  the  phosphorus  injector  inside  the  furnace 
was  not  considered,  as  well  as  the  thermophysical  properties  of  the  nitrogen  gas  were  assumed 
to  be  constant.  A  set  of  calculations  was  therefore  performed  to  examine  the  effects  of  these 
parameters.  The  phosphorus  injector  was  placed  at  its  topmost  location  which  simulates  the 
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Figure  6.  Predicted  temperature  and  flow  fields  in  right  half  of  the  enclosure  for  Gr  =  10^, 
the  effect  of  a  partition  wall,  (a)  flow  field  and  (b)  temperature  distribution. 


Figure  7a.  Predicted  flow  fields  in  right  half  of  the  enclosure  for  Gr  =  5x10"^, 
the  effect  of  a  partition  wall. 
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Figure  7b,  Predicted  temperature  field  in  right  half  of  the  enclosure  for  Gr  —  5x10^, 
the  effect  of  a  partition  wall. 
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crystal  growth  conditions,  and  the  gas  properties  were  allowed  to  vary  with  temperature.  These 
results  are  presented  in  Figs.  8-9  for  Gr  =  10*^,  5x10"^  and  5x10®,  respectively. 

In  comparison  to  Fig.  6a,  it  is  now  observed  that  several  cells  are  produced  in  the  melt 
region  at  Gr  =  10^  (Fig.  8a),  one  on  top  of  the  other  (underneath  the  P-injecto  '  .  This  is 
primarily  because  the  phosphorus  injector  restricts  the  flow  passage,  and  the  gas  can  go  out 
only  through  a  small  passage  between  the  injector  and  the  heat  shield.  The  flow  in  the  outer 
region  has  also  become  weaker  and  does  not  extend  to  the  top  portion  of  the  furnace  as  much 
as  it  did  earlier.  As  a  result  the  temperature  gradient  is  also  localized  in  the  melt  region  (Fig. 
8b). 

An  increase  in  Gr  again  complicates  the  flow,  and  two  large  cells  are  produced  in  the 
outer  region  (Fig.  9a).  These  two  convective  cells  have  resulted  from  the  hot  gases  coming  out 
through  the  gap  between  the  heat  shield  and  the  injector,  and  getting  cooled  on  the  furnace  side 
wall  and  top  as  wel’  s  by  the  seed  rod.  The  top  surface  and  seed  rod  provide  negative  buoy¬ 
ancy  for  the  upper  cell  while  the  side  wall  produces  a  second  cell  rotating  in  the  clockwise 
direction.  Several  secondary  cells  are  produced  when  the  Grashof  number  is  high  although  the 
basic  pattern  of  the  flow  remains  the  same.  The  flow  at  Gr  =  5x10®  is  oscillatory  (not  shown 
here).  It  seems  that  the  flow  restriction  has  reduced  the  buoyancy  effect  in  the  outer  region 
thereby  suppressing  the  oscillatory  behavior  to  some  extent.  Figure  9(b)  demonstrates  the 
complexity  of  the  temperature  fields  at  high  Grashof  numbers.  The  temperature  fields  reported 
in  Figs.  6(b)-9(b)  qualitatively  agree  with  the  temperature  measurements  reported  earlier. 

CONCLUDING  REMARKS 

In  order  to  develop  a  successful  process  for  one-step,  in-situ  synthesis  of  phosphorus 
vapor  and  indium  melt  and  growth  of  InP  crystals  of  large  sizes,  uniform  properties  and  at 
lesser  cost,  it  is  extremely  important  to  understand  the  mechanism  of  heat  transfer  and  gas 
flow  in  a  high  pressure  crystal  growth  system.  A  series  of  experiments  and  numerical  compu¬ 
tations  have  been  performed  to  study  the  thermal  conditions  in  an  HPCG  furnace.  Temperature 
distributions  have  been  measured  for  the  first  time  on  the  outer  surface  of  the  graphite 
susceptor  and  the  inner  surface  of  the  crucible.  The  effects  of  variation  in  power  supply  and 
the  vertical  location  of  the  crucible  have  also  been  examined.  The  numerical  results  demon¬ 
strate  interesting  flow  patterns  and  show  that  the  flow  becomes  oscillatory  at  high  Grashof 
number. 

Even  though  these  experiments  and  computations  provide  a  much  better  understanding 
of  the  HPCG  furnace  for  one-step,  in-situ  synthesis  and  growth  of  InP  crystals,  they  are  by  no 
means  complete.  Further  experiments  and  simulations  are  needed  to  characterize  the  heat 


13-18 


transfer  and  gas  flow  under  the  growth  conditions.  A  simultaneous  theoretical/numerical  and 
experimental  research  program  will  further  help  in  determining  the  optimized  conditions  for 
both  the  synthesis  and  the  growth,  and  in  suggesting  the  modifications  in  the  existing  system  in 
order  to  achieve  these  conditions. 
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CONDUCTED  INTERFERENCE  MEASUREMENT  RESULTS  FOR  A  GENERAL 
ELECTRIC  CORPORATION  SOFT  PART  ANALOGOUS  MODULE  (SPAM) 


John  P.  Rohrbaugh 

Electromagnetic  Environmental  Effects  Laboratory 
Georgia  Tech  Research  Institute 

Abstract 

Conducted  electromagnetic  interference  (EMI)  measurements  were  performed  on  a 
General  Electric  (GE)  Corporation  Soft  Part  Analogous  Module  (SPAM)  [1].  The  SPAM  is  a 
high  speed  1 -megabit  random  access  memory  (RAM)  that  was  built  by  GE  to  verify  multi -chip- 
module  (MCM)  fabrication  techniques  and  processes  prior  to  full  construction  of  radiation 
hardened  MCM’s.  Measurements  and  simulations  were  also  performed  on  the  individual 
integrated  circuits  (IC’s)  within  the  MCM  in  order  to  determine  if  simpler  and  less  expensive 
IC’s  could  be  evaluated  prior  to  testing  a  more  complex  and  costly  MCM.  This  report  discusses 
the  conducted  EMI  measurements  performed  on  the  GE  SPAM  and  the  SPAM  interface  IC’s 
while  the  simulation  results  are  discussed  in  Reference  2. 
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CONDUCTED  INTERFERENCE  MEASUREMENT  RESULTS  FOR  A  GENERAL 
ELECTRIC  CORPORATION  SOFT  PART  ANALOGOUS  MODULE  (SPAM) 


John  P.  Rohrbaugh 


1.0  INTRODUCTION 

Conducted  electromagnetic  (EM)  interference  measurements  were  performed  on  a  General 
Electric  Corporation  Soft  Part  Analogous  Module  (SPAM)  [1].  The  SPAM  is  a  high  speed  1- 
megabit  random  access  memory  (RAM)  that  was  built  by  GE  to  verify  multi-chip-module 
(MCM)  fabrication  techniques  and  processes  prior  to  full  construction  of  radiation  hardened 
MCM’s.  Measurements  and  simulations  were  also  performed  on  the  individual  IC’s  within  the 
MCM  in  order  to  determine  if  simpler  and  less  expensive  integrated  circuits  (IC’s)  could  be 
evaluated  prior  to  testing  a  more  complex  and  costly  MCM.  This  report  discusses  the 
measurements  performed  on  the  GE  SPAM  and  the  SPAM  interface  IC’s  while  the  simulation 
results  are  discussed  in  Reference  2, 

The  GE  SPAM  is  shown  in  Figure  1.  The  SPAM  is  illustrated  schematically  in  Figure 
2,  SPAM  package  pins  are  connected  to  Texas  Instruments  74S00  NAND,  74S138AN  3-to-8 
line  decoder,  74S373AN  D-flip-flop,  and  Performance  Semiconductor  P4C164  8kx8  static 
CMOS  RAM  integrated  circuits  (IC’s).  Only  input  data  and  control  pins  were  evaluated  on  this 
effort  and  not  -f-5  or  output  pins.  Continuous  wave  (CW)  single  frequency  interference 
measurements  were  performed  over  the  interference  frequency  range  of  10  MHz  to  2  GHz. 

2.0  Conducted  Interference  Test  Setup 

The  MCM  and  IC  conducted  Interference  test  setup  is  illustrated  in  Figure  3.  An 
HP8180A/8182A  digital  word  generator  and  analyzer  system  was  used  as  the  functional  signal 
source/analyzer.  The  HP8180A  is  a  digital  word  generator  having  sixteen  data,  one  strobe,  and 
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Figure  2.  Schematic  of  GE  SPAM. 


Figure  3.  Conducted  Interference  Test  Setup. 
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1-2QHZ  Anaran  1031(M  1-11  GHz  DQHC 


two  clock  outputs.  One  of  the  clock  outputs  was  used  as  the  external  clock  input  to  the 
HP8182A  digital  word  analyzer.  The  strobe  output  was  used  to  synchronize  the  HP8182A  to 
the  HP8180A  for  error  analysis.  The  HP8182A  was  used  to  determine  when  an  interference 
condition  occurred  by  looking  at  the  word  error  map  display.  When  a  word  error  occured,  the 
word  position  was  displayed  in  inverse  video  on  the  error  map.  The  HP8180A  output  levels 
were  set  at  2.4  volts  and  0.4  volts.  The  HP8182A  was  used  as  a  single-threshold  detector  with 
a  logic  0  to  1  transition  threshold  set  at  1.4  volts.  The  HP8182A  also  has  a  timing  diagram 
display  that  was  used  to  view  expected  data  timing  patterns.  An  oscilloscope  was  used  prior  to 
all  measurements  to  set  timing,  amplitude  levels  and  to  verify  interference  effects  and  thresholds. 

Various  combinations  of  signal  generators  and  amplifiers  were  used  to  perform  the 
conducted  interference  measurements  depending  upon  the  IC  under  test  and  the  interference 
frequency.  At  frequencies  less  than  100  MHz  (and  greater  than  100  MHz  when  performing 
measurements  on  the  SPAM  and  the  Performance  Semiconductor  RAM’s)  an  HP8640B  signal 
generator  was  used  as  the  interference  source.  The  output  level  and  frequency  of  the  HP8640B 
was  verified  to  be  accurate  using  independent  measurements  of  power  and  frequency  during 
calibration.  A  Mini-Circuits  RF  switch  was  used  to  gate  interference  signals  into  the  RAM  and 
SPAM  during  the  write  cycle  only  and  not  during  the  read  cycle.  Various  amplifiers  were  used 
after  the  RF  switch  if  the  interference  power  level  exceeded  the  RF  switch’s  1  db  compression 
pioint  of  18  dbm.  An  MCL  15022  RF  Power  Generator  was  used  at  frequencies  greater  than  1(K) 
MHz  when  testing  the  TI  interface  IC’s.  The  MCL  output  frequency  and  power  levels  were 
monitored  during  all  measurements.  Reflected  power  levels  were  recorded  when  the  reflected 
levels  were  greater  than  -30  dbm  (the  low  power  measurement  limit  of  the  power  meter  and 
detector  used  at  the  reflected  power  port  of  the  HP778D  dual  directional  coupler.)  In  order  to 
more  accurately  control  the  output  power  level  of  the  MCL  RF  Power  Generator  at  power  levels 
less  than  20  dbm,  a  10  or  20  db  attenuator  was  sometimes  used  at  the  generator  output. 

Various  digital  and  RF  signal  combiners  were  used  to  add  an  interference  signal  to  a 
digital  signal.  At  interference  signal  frequencies  below  100  MHz,  a  high  power  op  amp  circuit 
was  used  as  the  signal  combiner.  Various  dual  quadrature  hybrid  combiners  (DQHC)  were  used 
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to  add  the  digital  and  RF  signals  for  interference  frequencies  greater  than  100  MHz.  The  signal 
combiners  used  on  this  effort  were  designed  on  previous  GTRI  efforts  for  Rome  Laboratory  and 
are  described  in  detail  in  References  3  and  4.  The  op  amp  combiner  design  was  modified  on 
this  effort  with  a  schematic  of  the  modified  combiner  presented  in  Appendix  A. 

The  combined  signals  were  routed  to  an  IC  or  MCM  pin  using  a  50  ohm  micro-strip 
printed  circuit  (PC)  board.  The  micro-strip  PC  board  forms  part  of  a  Tester  Interface  Unit 
(TIU)  that  provides  32  controlled  impedance  paths  to  device  pins-under-test  up  to  frequencies 
of  12  GHz.  The  TIU  is  described  in  detail  in  Reference  3.  A  new  upper  aluminum  PC  board 
holder  and  micro-strip  PC  board  were  fabricated  on  this  effort  in  order  to  test  the  GE  SPAM. 
The  upper  and  lower  TIU  aluminum  PC  board  holder  drawings  shown  on  pages  C-5  and  C-6 
of  Reference  3  were  found  to  be  incorrect.  The  corrected  drawings  are  given  in  Appendix  B. 

3.0  Measurement  Results 

The  individual  IC  measurement  results  are  presented  in  Figure  4.  The  74S138AN  3-to-8 
line  decoder  G2A  and  G2B  pins  exhibited  an  unusual  susceptibility  profile  in  that  there  was  a 
20  db  increase  in  the  power  required  to  cause  interference  at  frequencies  greater  than 
approximately  100  MHz.  The  transition  region  was  measured  to  be  only  a  few  MHz  and  was 
very  repeatable  with  other  IC’s.  Simlations  (see  Reference  2)  verified  the  transition,  although 
the  exact  reason  for  the  sudden  increase  is  unknown. 

The  P4C164  RAM  exhibited  varying  susceptibility  thresholds  and  curves  for  various 
inputs  and  outputs.  The  most  susceptible  pin  was  the  CE2  enable  pin. 

The  interference  thresholds  for  the  GE  SPAM  are  shown  in  Figure  5.  Above 
approximately  300  MHz  the  MCM  connector  begins  to  resonate  with  power  being  reflected  at 
the  connector  interface  to  the  MCM.  The  interference  thresholds  do  not  compare  well  with  the 
individual  IC’s  above  approximately  200  MHz  because  of  the  MCM  connector. 
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Incident  Interference  Thresholds  for  74SOO 
2-lnput  NAND  Gate. 


Incident  Interference  Thresholds  for  74S13S.\N 
3-to-8  Line  Decoder. 


Frequency  (MHz) 

Incident  Interference  Thresholds  for  74S373AN 
D-Type  Flip-Flop. 


Frequency  (MHz) 

Incident  Interference  Thresholds  for  P4C164 
8k  by  8  Static  RAM. 


Figure  4.  Individual  IC  Susceptibility  Measurement  Results. 
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10  100  1000  10000 
Frequency  (MHz) 

Incident  Interference  Thresholds  for  General  Electric 
Soft  Part  Analogous  Module  (SPAM). 


Figure  5.  GE  SPAM  Interference  Measurement  Results. 
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4,0  Vulnerability  Assessment 


The  results  of  this  effort  indicate  that  the  the  GE  SPAM  module  is  susceptible  to  EM 
interference  but  not  whether  a  vulnerability  condition  could  exist.  To  perform  a  vulnerability 
assessment,  an  estimate  was  made  of  the  coupled  signal  levels  due  to  external  EM  threats. 
Radiated  field  levels  from  Reference  5  were  used  to  determine  if  the  modules  could  be 
vulnerable  to  upset  or  damage  due  to  EM  field  exposure.  It  was  assumed  that  each  pin  was 
connected  to  a  tuned  dipole  for  analysis  purposes.  Techniques  from  Reference  6  were  used  to 
calculate  coupled  signal  levels  given  the  threat  source  profile  in  Figure  6  (from  Reference  5)  and 
for  separation  distances  of  1  and  10  km.  Given  the  power  density  profile  in  Figure  6,  the  power 
received  at  an  IC  pin  can  be  calculated  using  the  following  equation, 


or  in  decibels  as 


2n 


201ogA.  -  7.98 


where, 

P,  =  recieved  power  in  dBm 

G,  =  receive  antenna  gain,  which  for  a  tuned  dipole  is  2.14  dBi 
Pd  =  incident  power  density  in  dBm  per  square  centimeter 
X  =  wavelength  in  centimeters 


Figure  7  overlays  the  GE  SPAM  susceptibility  profile  over  the  calculations  of  coupled 
signal  levels.  Coupling  to  any  of  the  pins  could  lead  to  vulnerability  over  the  measurement 
frequency  range  of  100  MHz  to  500  MHz  for  threat  sources  located  at  either  1  or  10  km. 
Additional  filtering,  limiting  and  shielding  of  up  to  60  dB  would  be  required  to  prevent 
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Frequency  (MHz) 


Figure  7.  GE  SPAM  Susceptibility  Thresholds  and  Calculated  Threat  Levels. 
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vulnerability.' 


5.0  Conclusions  and  Recommendations 

The  GE  SPAM  and  individual  IC  measurement  results  did  not  correlate  well  at 
frequencies  greater  than  200  MHz  because  of  reflections  that  occured  at  the  GE  SPAM  connector 
interface.  In  actual  practice  this  connector  would  not  be  used.  It  is  expected  that  the  GE  SPAM 
and  individual  IC  measurement  results  would  correlate  better  at  high  frequencies  if  the  interface 
connector  wer  not  used.  Even  with  the  interface  connector,  the  GE  SPAM  could  still  be 
vulnerable  to  EM  interference.  Care  must  be  taken  to  assure  system  vulnerability  does  not  occur 
by  adequately  shielding  and  filtering  the  interconnect  lines  to  the  GE  SPAM. 

Based  on  the  results  of  this  effort  and  previous  efforts  for  Rome  Laboratory  by  GTRI, 
it  is  recommended  that  a  Designer’s  Guide,  a  Designer’s  Handbook,  and  a  short  course  be 
prepared  for  use  by  Multi-Chip  Module  developers.  The  above  materials  should  discuss 
simulations,  measurements,  analyses,  trade-offs,  and  EM  mitigation  techniques  that  can  be  used 
prior  to  full-scale  development  of  an  MCM. 
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Appendix  A.  Signal  Combiners. 


The  schematic  of  the  op  amp  combiner  built  on  this  effort  is  shown  in  Figure  A-1.  An 
Elantec  EL-2009  buffer  is  used  at  the  output  of  a  Comlinear  CLC-220  op  amp. 


R1  R2  +12V  R12 


Figure  A-1.  Op  Amp  Combiner. 
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Appendix  B.  Tester  Interface  Unit  Aluminum  PC  Board  Holders. 


The  upper  and  lower  aluminum  PC  board  holder  drawings  for  the  Tester  Inteface  Unit 
are  incorrect  in  Reference  3.  Below  is  a  corrected  drawing  for  the  upper  PC  board  holder  that 
was  fabricated  on  this  effort. 


SI  DC  VICV-/  THPU  A -A 
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Abstract 

We  have  studied  the  near-handgap  second  order  optical  nonlinearities  of 
several  semiconductors  possessing  the  zincblende  structure.  Specifically,  we 
concentrated  on  difference-frequency  generation  (DFG)  and  sum-frequency 
generation  (SFG).  We  simultaneously  measured  the  DFG  radiation  field  and 
the  SFG  intensity  as  a  function  of  crystallographic  orientation  and  incident 
photon  energy.  To  perform  the  measurements,  we  select  <100>,  <110>,  and 
<11 1>  orientated  samples  of  GaAs,  InP  and  CdTe.  Herein,  the  pronounced 
near-bandgap  resonant  behaviors  of  both  DFG  and  SFG  are  compared  for  the 
first  time. 
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Second  Order  Optical  Nonlinearity  near  the  Bandgap  of  Zincblende  Crystals 


X.-C.  Zhang 


Introduction 

Recently,  several  dififerent  optoelectronic  techniques  have  been  developed  to 
generate  broadband  electromagnetic  pulses  in  free  space  by  utilizing  ultrafast  laser 
pulses.  These  techniques  include  the  use  of  high-speed  photoconductors  as  ultrafast 
current  sources  for  radiating  antennas  [1-5],  and  the  use  of  electro-optic  crystals  for 
THz  optical  rectification  media  [6-8].  Optical  rectification  is  a  second  order 
nonlinear  process.  Conventional  optical  rectification,  which  involves  a 
monochromatic  pump  beam  to  generate  a  dc  dielectric  polarization,  does  not  radiate 
electromagnetic  waves.  However,  if  the  pump  optical  beam  contains  an  ultrafast 
pulse  train,  the  different  components  within  the  frequency  broadening  of  the 
fundamental  beam  form  a  beating  polarization  [9].  This  time-varying  dielectric 
polarization  produces  a  transient  dipole  which  radiates  electromagnetic  waves  [10]. 
The  rectification  radiation  of  electromagnetic  pulses  with  a  frequency  range  from 
GHz  to  THz  has  been  demonstrated  [5].  THz  optical  rectification  radiation  is 
preferable  because  of  superior  pulse  duration  and  bandwidth.  To  date,  the  newly 
measured  THz  optical  rectification  of  nonlinear  organic  materials  provides  the  most 
intense  THz  radiation  field  among  all  of  the  natural,  non-extemally  biased 
materials  [11].  Besides  optical  rectification,  second  harmonic  generation  is  another 
second-order  process  which  has  been  well  studied  and  widely  used  in  many 
nonlinear  crystals.  However,  due  to  the  dispersion  of  second  order  susceptibility,  an 
electro-optic  material  which  produces  intense  second  harmonic  generation  may  not 
be  suitable  as  an  efficient  source  of  THz  radiation.  It  is  important  to  further  explore 
the  relationship  between  sum-frequency  generation  (SFG)  and  difference-frequency 
generation  (DFG)  processes  to  understand  the  second  order  nonlinearities  at  a 
wider  frequency  range.  During  the  AFOSR  Summer  Research  Program,  we  studied 
second  order  optical  nonlinearity:  difference-frequency  generation(DFG)  and  sum- 
frequency  generation  (SFG)  radiations  with  the  optical  excitation  near  the  bandgap 
of  zincblende  crystals  [12].  The  DFG  field  strength  and  the  SFG  intensity  from 
GaAs,  InP,  and  CdTe  were  measured  simultaneously  versus  crystallographic 
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orientation  and  incident  photon  energy  under  several  polarization  schemes. 
Further,  the  pronounced  near-bandgap  resonant  behavior  of  both  SFG  and  DFG 
radiations  are  compared  for  the  first  time. 


Experimental  Set  up 


Fig.  1  schematically  illustrates  the  experimental  arrangement  (side  view)  of 
an  SFG  and  DFG  spectroscope.  A  tunable  femtosecond  laser  beam  (a  mode-locked 
Ti:sapphire  laser)  illuminates  the  sample  at  normal  incidence.  The  optical  pump 
beam  is  focused  on  the  crystal  with  a  spot  diameter  of  35  |im.  During  the  photon 
energy  timing,  the  intensity  was  kept  constant  at  58  kW/cm^  . 


CO 


semiconductor 


Fig.  1:  Experimental  arrangement  for  the 
measurements  of  SFG  and  DFG. 


The  zincblende  crystals  are  semiinsulating  GaAs,  InP,  and  CdTe  wafers.  A  dielectric 
optical  beam  splitter  and  a  beam  splitter  using  a  thin  metallic  film  on  a  glass  plate 
are  used  to  reflect  the  SFG  light  and  the  DFG  field,  respectively.  The  backward 
radiated  SFG  and  DFG  signals  at  different  polarizations  (parallel  and 
perpendicular  to  the  polarization  of  the  optical  excitation  beam,  respectively)  are 
measured  simultaneously.  The  peak  value  of  the  time-integrated  SFG  signal  (light 
intensity)  is  detected  by  a  photomultiplier  attached  to  a  one-meter  spectrometer, 
and  the  peak  value  of  the  DFG  signal  (electric  field)  is  detected  by  a  picosecond 
photoconducting  antenna.  During  the  measurement  of  energy  dispersion,  which 
requires  laser  energy  tuning,  the  spectrometer  is  scanned  to  find  the  peak  value  of 
the  SFG  signal.  To  reduce  experimental  error  generated  by  the  random  fluctuation 
of  optical  power  and  temporal  pulse  duration  of  the  fundamental  beam,  a  quartz 
crystal  (z-cut)  is  used  as  a  reference  sample  to  normalize  the  nonlinear  radiation 
signals.  The  measurement  is  performed  at  room  temperature. 
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Calculation 


When  a  femtosecond  laser  pulse  illuminates  a  <111>  zincblende  crystal  with 
normal  incidence,  equations  for  DFG  polarization,  P(Q),  and  SFG  polarization, 
P(2o)+G),  can  be  derived.  They  are 


Pl  I  (G)  =^di4(G;  co+G,  -co)  sin(30)E(a>+G)E*(co) 

(1) 

Pl(G)  =  ^di4(G;  co-i-G,  -co)  cos(30)E(a)-i-G)E*(co) 

(2) 

Pi  I  (2a)-hG)  =-^di4(2co+G;  om-G,  co)  sin(30)E(co-i-G)E(co) 

V6 

(3) 

4 

Pl(2co-i-G)  =  -7=di4(2co-i-G;  co-i-G,  co)  cos(30)E(co-f-G)E(co) 

Ve 

(4) 

where  P|  |  and  Pi  are  dielectric  polarizations  parallel  and  perpendicular  to  the 
pump  beam,  respectively,  and  di4  is  a  susceptibility  tensor  element.  The  azimuthal 
angle  0  is  measured  with  respect  to  the  <l,l,-2>  direction.  The  SFG  beam  intensity 
is  proportional  to  the  square  of  the  dielectric  polarization  (I(2o))  oc  P(2(o)2 ),  and  the 
DFG  field  strength  Er  is  proportional  to  the  dielectric  polarization  (Er(Q)o=  P(Q)). 
The  measured  Er  (t)  in  time  is  the  Fo;uier  transform  of  Er(n)  over  the  frequency 
spectrmn  Aco  where  Aco  is  comparable  to  the  spectrum  broadening  of  the  optical 
piunp  pulse. 

For  <110>  oriented  samples,  two  equations  for  DFG  polarization,  P(Q)  are; 

Pi  I  (G)  =— di4(G;  (o+fl,  -0))  [cos(30)-cos(0)]E(o)+G)E*(co)  (5) 

4 

P 1  (G)  =  -di4(G;  co+G,  -oj)  [3cos(30)+cos(0)]E(co+G)E*(co)  (6) 

4 

Experimental  Results 

Fig.  2  and  Fig.  3  plot  the  peak  values  of  second  harmonic  signal  (intensity) 
and  THz  beam  (field  strength)  versus  0,  with  the  polarization  of  the  detected  signal 
perpendicular  (Fig.  2)  and  parallel  (Fig.  3)  to  that  of  the  pump  light  (fundamental 
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beam),  respectively.  The  dots  are  the  experimental  data  and  the  curves  are 
normalized  plots  from  the  calculations  of  Er(^i)  and  I(2o)). 


0  90  180  270  360  0  90  180  270  360 

Azimuthal  Angle  (Degrees)  Azimuthal  Angle  (Degrees) 


Fig.  2:  Peak  values  of  SH  intensity  (solid  dots)  Fig.  3:  Same  as  Fig.  2  except  the  polarization 
and  THz  field  (open  dots)  vs  azimuthal  of  the  nonlinear  signals  is  detected 

angle  for  <111>  GaAs;  the  polarization  parallel  to  the  fundamental  light.  The 

of  the  nonlinear  signals  is  detected  solid  line  is  calculated  from  Equations 

perpendicular  to  the  fundamental  (1)  and  (3). 

light.  The  solid  line  is  calculated  from 
Equations  (1)  and  (3). 


0  90  180  270  360  0  90  180  270  360 

Azimuthal  Angle  (degrees)  Azimuthal  Angle  (degrees) 

Fig.  4  Peak  values  of  THz  field  (open  dots)  vs  Fig.  5  Same  as  Fig.  4  except  the  polarization  of  the 
azimuthal  angle  for  <110>  CdTe;  the  nonlinear  signals  is  detected  perpendicular 

polarization  of  the  nonlinear  signals  is  to  the  fundamental  light.  The  solid  line  is 

detected  parallel  to  the  fundamental  calculated  from  Equations  (6). 

light.  The  solid  line  is  calculated  from 
Equations  (5). 

As  we  expected  from  the  calculations,  the  THz  field  shows  a  three-fold  rotation 
symmetry  (sin(30)  and  -cos(30)),  and  the  second  harmonic  intensity  shows  a  six-fold 
rotation  symmetry  (l+cos(60)).  A  similar  angular  dependence  was  observed  in  a 
<111>  oriented  CdTe  sample.  We  also  measured  the  THz  field  from  a  <110> 
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oriented  zincblende  crystals  (GaAs,  InP  and  CdTe)  with  near  bandgap  optical 
excitation.  The  THz  signals,  plotted  in  Fig.  4  and  Fig.  5,  showed  Pn  (Q)oc  cos(30)  - 
cos(30)  and  Pl(G)  oc  cos(0)  +  3cos(30)  measured  from  <00- 1>  direction,  consistent 
with  calculations  for  second  order  nonhnearity  in  a  <110>  crystal  [13].  Excellent 
agreement  between  the  experimental  data  and  the  calculations  from  second-order 
nonlinearity  has  thus  been  foimd  for  both  <111>  and  <110>  oriented  zincblende 
crystals. 

It  should  be  noted  that  when  the  focusing  spot  of  the  fundamental  beam  is 
comparable  with  the  wavelength  of  the  DFG  radiation  (nearly  one  millimeter),  the 
plane  wave  approximation  is  not  valid.  A  simple  estimate  shows  that  at  the  tightest 
focusing  point  (a  few  |jm),  the  SFG  intensity  reaches  a  maximum  while  the  strength 
of  the  DFG  field  drops  to  a  minimum.  We  have  confirmed  this  experimentally.  This 
consideration  does  not  affect  qualitative  comparisons  between  DFG  and  SFG  in 
crystal  orientation  and  energy  dispersion  measurements. 

We  observed  resonance  enhancement  and  sign  reversal  of  the  nonlinear 
susceptibility  near  the  bandgap  of  zincblende  crystals  by  using  a  THz  optical 
rectification  technique  [8].  When  the  laser  photon  energy  was  tuned  70  meV  lower 
than  the  GaAs  bandgap,  the  rectified  DFG  signal  was  nearly  unchanged  for  small 
energy  tuning.  When  the  laser  was  timed  to  the  bandgap,  the  amplitude  of  the  DFG 
signal  increased,  reaching  a  peak  when  the  laser  energy  was  just  above  the 
bandgap.  Further  tuning  of  the  laser  energy  to  the  higher  energy  side  results  in  a 
polarity  change  of  the  radiation  signal.  This  dramatic  change  of  THz  radiation  in 
<111>  GaAs  is  due  to  the  resonant  nonlinear  susceptibility.  The  resonance 
enhancement  near  the  electronic  resonance  states  has  been  confirmed  in  other 
samples.  Fig.  6  shows  the  DFG  signal  from  a  <111>  InGaAs/GaAs  superlattice 
sample  versus  optical  excitation  energy  at  normal  incidence.  In  addition  to  the 
resonance  due  to  GaAs  (barriers)  at  1.45  eV,  an  additional  resonance,  which  is  due 
to  the  bound  states  in  the  InGaAs  quantum  wells,  is  observed  at  1.40  eV.  The  origin 
of  a  small  feature  at  1.53  eV  is  not  clear  yet.  During  the  laser  energy  timing,  the 
temporal  duration  of  the  rectified  field  remained  essentially  unchanged.  We  also 
observed  similar  resonant  behavior  from  <111>  oriented  CdTe  samples.  Both  the 
resonance  enhancement  and  sign  reversal  near  the  bandgap  of  CdTe  (Eg  is  about 
1.50  eV)  are  observed,  as  shown  in  Fig.  7.  The  strong  radiation  signal  around  1.4  eV 
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may  be  due  to  the  impurity  states  in  CdTe.  Photoluminescence  measurement  has 
shown  intense  emission  from  impurity  states  in  this  CdTe  crystal. 


1.35  1.40  1.45  1.50  1.55  1.35  1.40  1.45  1.50  1.55 


Photon  Energy  (eV)  Photon  Energy  (eV) 

Fig.  6  Peak  value  of  the  DFG  signal  (dots)  vs  Fig.  7;  Peak  value  of  the  DFG  signal  vs  laser 
laser  energy  from  a  <111>  CdTe.  Solid  energy  from  a  <111>  InGaAs/GaAs 

line  is  guided  for  eye.  superlattice. 
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Fig.  6  Peak  value  of  the  electromagnetic  Fig.  7  Peak  values  of  the  integrated  SFG  light 

radiation  signal  vs  laser  energy  from  a  intensity  and  the  DFG  field  strength  from 

biased  <100>  GaAs.  a  GaAs  sample  versus  the  fundamental 

photon  energy. 

We  have  also  calculated  the  nonlinear  polarization  from  a  perfect  <100> 
oriented  GaAs  bulk  sample.  The  calculation  showed,  regardless  of  the  azimuthal 
rotation  of  the  crystal,  no  DFG  signal  in  the  forward  direction  under  normal 
incidence.  This  calculated  result  was  confirm:  1  by  our  experiment.  Since  <100> 
oriented  GaAs  can  not  emit  a  forward  THz  beam,  we  fabricated  a  pair  of  parallel 
planar  electrodes  on  a  <100>  GaAs  wafer  to  apply  an  external  bias.  The  forward 
radiation  from  a  biased  <100>  GaAs  sample  (pure  transient  photocurrent)  versus 
photon  energy  is  plotted  in  Fig.  6.  This  is  a  good  comparison  of  the  forward 
radiation  driven  by  the  transient  photocurrent  in  the  biased  external  field  and  the 


I  I  t  /  \  t\ 

Photon  Energy  (eV)  Photon  Energy  (eV) 
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DFG  (a.u.) 


forward  radiation  generated  by  ultrafast  optical  rectification.  Clearly,  no  sign 
reversal  is  measured  from  the  biased  <100>  GaAs  in  Fig.  6. 

From  an  anharmonic  oscillator  model,  when  the  fundamental  photon  energy 
Ji(o  is  close  to  bandgap  Eg,  the  singularities  in  the  principal  value  of  the  integral  of 
susceptibility  provide  resonance  behavior  for  both  SFG  and  DFG.  Fig.  7  is  a  plot  of 
peak  values  of  the  time-integrated  SFG  intensity  and  DFG  field  from  a  <111>  GaAs 
sample  (the  same  sample  we  used  for  azimuthal  angle  measurements)  versus  the 
fimdamental  photon  energy.  The  dispersive  feature  near  the  bandgap  Eg  =  1.42  eV 
is  clear  for  both  second-order  signals.  However,  unhke  the  optical  rectification 
signal,  the  second  harmonic  signal  does  not  show  an  expected  minimum  at  the 
position  where  the  DFG  signal  crosses  the  zero-line;  instead,  it  decreases  as  the 
photon  energy  is  timed  above  the  bandgap.  This  second  harmonic  dispersion  of 
GaAs  near  the  bandgap  was  observed  previously  [14].  The  second  harmonic 
nonlinearity  is  purely  electronic  since  its  frequency  is  well  above  the  lattice  mode 
frequencies.  The  optical  rectification  nonlinearity,  on  the  other  hand,  is  ionic  as  well 
as  electronic,  since  the  frequency  lies  below  the  optical  lattice  mode  frequencies. 
Currently  the  discrepancy  of  the  dispersion  curves  between  the  SFG  and  DFG 
signals  is  not  fully  xmderstood.  One  possible  reason  may  be  the  different  absorption 
behaviors  of  SFG  and  DFG  signals  in  the  GaAs  crystal. 

Conclusion 

We  report  the  experimental  results  of  sum-frequency  generation  (SFG)  and 
difference-frequency  generation  (DFG)  from  <100>,  <110>  and  <111>  oriented 
zincblende  crystals.  The  simultaneous  measurement  of  near-bandgap  resonant  SFG 
intensity  and  DFG  field  versus  crystallographic  orientation  and  incident  photon 
energy  are  presented  for  the  first  time. 
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DESIGN  OF  AN  INTERFACE  BETWEEN  THE  GENERIC 
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ABSTRACT 

The  goal  of  this  document  is  to  specify  the  design  of  an  interface  between  a  data 
extraction  system  called  the  Generic  Intelligence  Processor  (GIP)  and  a  natural 
language  understanding  (NLU)  system  called  the  Probabilistic  Language  Understanding 
Model  (PLUM).  The  interface  will  provide  a  mechanism  for  processing  natural 
language  fragments  through  PLUM  in  the  GIP  environment.  The  central  design 
strategy  for  this  interface  is  to  make  little  or  no  changes  to  the  existing  systems, 
namely,  GIP  and  PLUM. 
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DESIGN  OF  AN  INTERFACE  BETWEEN  THE  GENERIC 
INTELLIGENCE  PROCESSOR  (GIP) 

AND  A  NATURAL  LANGUAGE  UNDERSTANDING  SYSTEM 


Pradip  Peter  Dey 


1.  INTRODUCTION 


The  goal  of  this  document  is  to  specify  the  design  of  an  interface  between  a  data 
extraction  system  called  the  Generic  Intelligence  Processor  (GIP)  and  an  NLU  system 
called  PLUM.  The  interface  will  provide  a  mechanism  for  processing  natural  language 
fragments  through  PLUM  in  the  GIP  environment.  It  is  desirable  that  an  appropriate 
version  of  PLUM  is  selected  for  this  interface  since  several  versions  of  PLUM  are 
available  and  not  all  versions  are  equally  suited  for  this  task.  GIP  has  two  distinct 
functional  roles:  as  an  application  builder,  and  as  a  run-time  message  processing 
application.  One  of  them  have  to  be  selected  for  this  interface.  There  are  several  other 
design  decisions  that  have  to  be  made  during  the  design  process. 

When  used  with  NLU  components,  GIP  would  be  a  complicated  system  requiring 
enormous  computing  power.  Efficient  execution  of  GIP  is  possible  only  by  clever 
utilization  of  computing  resources  around  the  GIP  environment.  In  order  to  achieve 
this,  parallel  and  distributed  execution  of  GIP  modules  wiU  be  necessary.  With  this 
point  in  mind  the  proposed  interface  will  execute  PLUM  on  a  remote  site. 


The  general  approach  to  the  design  and  development  of  the  proposed  interface 
system  is  rapid  prototyping  (Pressman  1992,  Boar  1984,  Cieslak  et  al.  1989).  This 
approach  has  many  advantages  for  developing  a  large  class  of  software  (Luqi  1989). 
Boehm,  Gary  and  Seewaldt  (1984)  found  that  software  products  developed  using 
prototyping  performed  as  well  as  those  developed  using  other  traditional  techniques. 
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In  addition,  45%  less  effort  was  expended  and  40%  fewer  lines  of  code  were  generated 
by  the  developers  who  used  the  prototyping  paradigm. 

The  specific  design  method  followed  in  this  document  is  known  as  the  Data-Flow 
oriented  design  (Pressman  1992). 


2.  GENERIC  INTELLIGENCE  PROCESSOR  (GIP) 

GIP  and  PLUM  are  not  described  in  this  document  except  for  some  key  elements 
that  are  directly  referenced  by  the  proposed  interface.  A  detailed  description  of  GIP  is 
available  in  a  Software  Design  Document  written  by  the  Sterling  IMD,  Inc.  (Sterling 
1993).  The  main  goal  of  GIP  is  to  extract  information  from  formatted  and  unformatted 
messages  and  transmit  the  results  to  downstream  databases  and  processes  such  as 
expert  systems.  GIP  is  a  toolkit  for  developing,  testing,  and  delivering  message 
processing  applications.  When  fully  developed  GIP  will  provide  the  following 
capabilities  (a)  user  selectable  input  sources,  (b)  automatic  message  parsing,  (c) 
automatic/user  assisted  information  extraction,  (d)  automatic  output  formatting  for 
development  systems,  (e)  utilities  for  developing  new  algorithms.  For  processing 
messages  with  substantial  natural  language  texts  user  assistance  will  be  crucial  for 
GIP,  because,  natural  language  processing  technology  is  not  mature  enough  for  fully 
automated  information  extraction  in  an  operational  environment.  However,  for  the 
demonstration  interface  the  user  assistance  facility  has  been  suppressed  in  order  to 
achieve  appropriate  speed-up.  The  GIP  program  components  are  reproduced  in 
Figure- 1. 
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FactbaM 


Figure- 1:  GIF  Program  Components  (From  Sterling  1993) 

The  most  crucial  GIP  module  for  the  interface  is  the  UPARSE  component.  UP  ARSE 
allows  manual  data  extraction  as  well  as  automatic  data  extraction  through  NLU 
systems.  A  candidate  NLU  system  is  PLUM.  Since  PLUM  does  not  meet  all  the 
requirements  that  GIP  imposes  on  a  NLU  system,  this  interface  is  designed  to  fill  the 
gap  between  the  two.  A  natural  strategy  for  this  design  would  therefore  be  to  meet  all 
the  requirements  through  the  interface  that  are  imposed  by  GIP  but  lacking  in  PLUM. 
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3.  PLUM 


PLUM  is  a  state-of-the-art  NLU  system  for  data  extraction  from  NL  texts.  It  is 
intended  to  be  robust  and  scalable.  Besides  lexical,  morphological,  syntactic,  and 
semantic  components  PLUM  has  a  statistical  learning  module  that  enhances  its 
robustness  and  scalability.  A  detailed  description  of  PLUM  is  available  in  an  article 
entitled  "BBN  PLUM;  MUC-4  Test  Results  and  Analysis"  (Weischedel  et  al.  1992, 
Weischedel  et  al.  1993)  and  some  other  BBN  reports  (BBN  et  al.  1993).  The  MUC-4 
version  of  PLUM  is  selected  for  this  interface  for  the  reasons  described  in  Section-5.1. 
This  version  can  be  run  in  the  client-server  mode  as  described  at  the  end  of  this 
section. 


Since  we  do  not  int^  • '  to  modify  PLUM,  its  internal  structure  is  not  important  for 
our  purpose.  Input  to  Fi_  -JM  is  a  text-file  containing  a  free  text  message  like  the  one 
given  in  Figure-2.  The  output  from  PLUM  is  a  text-file  containing  a  set  of  templates 
like  the  one  given  in  Figure-3.  (These  examples  are  taken  from  PLUM  environment, 
although  they  appear  to  come  from  MUC-3.)  PLUM  tries  to  fill  as  many  fields  as 

possible.  When  it  cannot  fill  a  field  it  marks  it  with  a  ,  following  the  MUC 
convention. 

DEV-MUC3-0016  (NOSC) 

BUENOS  AIRES,  10  JAN  90  ;  >A)  -  [TEXT|  A  UCR  [RADICAL  CIVIC  UNION] 
BUENOS  AIRES  PROVINCE  COMMITTEE  SPOKESM/.N  TODAY  REPORTED  THAT  A 
UCR  DISTRICT  HEADQUARTERS  IN  BUENOS  AIRES  PROVINCE  WAS 
"COMPLETELY  DESTROYED  BY  A  BOMB  EXPLOSION."  THE  DESTROYED  UCR 
HEADQUARTERS  IS  IN  THE  MORENO  DISTRICT  OF  BUENOS  AIRES. 

THE  SPOKESMAN  ADDED  THAT  NO  ONE  WAS  INJURED  AND  THAT  LEOPOLDO 
MOREAU,  THE  UCR  BUENOS  AIRES  PROVINCE  LEADER,  HAS  TRAVELED  TO 
MORENO. 

IT  IS  EXPECTED  THAT  UCR  BUENOS  AIRES  PROVINCE  BOARD  WILL  MEET  AND 
THAT  THEY  WILL  ASK  THE  PROVINCIAL  GOVERNMENT  AND  THE  EXTERIOR 
MINISTRY  TO  INVESTIGATE  THE  INCIDENT. 


Figure-2:  A  MUC-3  Message 
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0.  MESSAGE:  ID  DEV-MUC3-0016  (NOSC) 

1.  MESSAGE:  TEMPLATE  1 

2.  INCIDENT:  DATE  10  JAN  90 

3.  INCIDENT:  LOCATION  ARGENTINA:  BUENOS  AIRES  PROVINCE  (PROVINCE) 

4.  INCIDENT:  TYPE  BOMBING 

5.  INCIDENT:  STAGE  OF  EXECUTION  ACCOMPLISHED 

6.  INCIDENT:  INSTRUMENT  ID 

7.  INCIDENT:  INSTRUMENT  TYPE 

8.  PERP:  INCIDENT  CATEGORY 

9.  PERP:  INDIVIDUAL  ID 

10.  PERP:  ORGANIZATION  ID 

11.  PERP:  ORGANIZATION  CONFIDENCE  - 

12.  PHYS  TGT:  ID  "A  UCR  DISTRICT  HEADQUARTERS" 

13.  PHYS  TGT:  TYPE  ORGANIZATION  OFFICE:  "A  UCR  DISTRICT  HEADQUARTERS 

14.  PHYS  TGT:  NUMBER  1:  "A  UCR  DISTRICT  HEADQUARTERS" 

15.  PHYS  TGT:  FOREIGN  NATION 

16.  PHYS  TGT:  EFFECT  OF  INCIDENT  DESTROYED:  "A  UCR  DISTRICT  HEADQUARTERS" 

17.  PHYS  TGT:  TOTAL  NUMBER 

18.  HUM  TGT:  NAME 

19.  HUM  TGT:  DESCRIPTION 

20.  HUM  TGT:  TYPE 

21.  HUM  TGT:  NUMBER 

22.  HUM  TGT:  FOREIGN  NATION 

23.  HUM  TGT:  EFFECT  OF  INCIDENT 

24.  HUM  TGT:  TOTAL  NUMBER 

Figure-3:  The  TEMPLATE  Produced  by  PLUM  from  the  Message  of  Figure-2 

The  MUC-4  version  of  PLUM  can  be  run  in  several  modes;  the  most  appropriate  for 
this  interface  being  the  Client-Server  mode.  It  reads  a  single  message  from  an  input- 
file  named  .to_plum  and  writes  output  to  .from_plum.  Multiple  messages  are  processed 
by  copying  messages  from  /home/plum/TIPSTER/msg  to  .to_plum  and  copying  out-put 
from  from  .from_plum  to  the  same  directory.  The  PLUM  server  is  currently  activated 
by  performing  the  following  3  actions  in  the  given  order: 

(1)  giving  the  command  "run-muc"  from  UNIX;  this  will  start  LISP, 

(2)  loading  a  file  into  LISP  by  "(load  "demo-server-patch.lisp")" 

(3)  executing  "(file-message-server)"  in  LISP. 
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This  sequence  of  3  actions  makes  the  server  ready  for  processing  messages.  When 
the  server  is  ready,  the  client  process  is  activated  by  the  command  "serv_test"  from  a 
UNIX  shell.  Once  the  client  establishes  connection  with  the  server  the  messages  are 
taken  one  by  one  from  /home/plum/l’lPSTER/msg  directory  for  processing.  Under 
this  directory  input  messages  are  in  msg-inX  files  and  after  each  message  is 
processed  the  output  is  written  to  a  corresponding  msg-outX  file  in  the  same  directory. 
Currently  serv_test.c  is  written  in  such  a  way  that  maximum  of  5  messages  are 
processed  in  a  session;  after  that  the  client  and  the  server  continue  to  run  without 
processing  any  messages.  This  can  be  easily  changed  by  modifying  serv_test.c.  The 
client  and  the  server  can  be  stopped  by  a  signal  such  as  Ctrl-Z. 


4.  REQUIREMENTS  ANALYSIS 

The  software  design  presupposes  software  requirements  analyses.  The  requirements 
that  GIP  imposes  on  its  NLU  components  are  described  in  Sterling  (1993;  3-5).  GEP 
is  supposed  to  interface  to  NLU  components  via  files  stored  in  a  Network  File  System 
(NFS)  mounted  file  system  as  well  as  through  interprocess  communication  messages 
sent  via  the  Event  Manager.  "All  input  data  for  the  NLU  will  be  stored  in  a  specified 
NFS  mounted  file  system  which  the  NLU  has  access  to.  This  data  includes  the  free 
text  from  the  messages,  the  templates  available  for  filling  by  the  NLU,  and  any 
formatted  information  that  has  been  extracted  from  the  message,  such  as  message 
header  data.  GIP  will  then  notify  the  NLU  application  of  the  input  data  location  via  a 
message  posted  with  Event  Manager.  Templates  filled  by  the  NLU  are  likewise 
written  to  an  NFS  mounted  file  system  which  can  be  accessed  by  GIP.  The  NLU  will 
post  a  message  for  GIP  via  the  Event  manager  that  contains  the  location  of  NLU 
processing  results..."  (Sterling  1993:  3-5).  Figure-4,  taken  from  (Sterling  1993) 
illustrates  the  requirements  that  GIP  imposes  on  the  NLU. 
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Figure-4:  NLU  Interface  (From  Sterling  1993) 

Flow  of  information  is  an  important  aspect  of  data-flow  oriented  analysis.  A 
Level-0  data-flow  diagram  given  in  Figure-5  presents  the  flow  of  information  in  the 
proposed  interface  system  (Pressman  1992).  Input  to  PLUM  comes  from  the  UPARSE 
component  of  GIP.  Normally,  free  text  messages  and  their  associated  blank  templates 
would  be  input  to  PLUM.  However,  there  is  really  no  need  to  transmit  the  blank 
templates  from  GIP  to  PLUM  for  this  interface  version.  Because  these  templates  are 
already  available  to  PLUM  in  the  expected  form.  We  have  chosen  a  version  of  PLUM 
that  deals  only  with  MUC-4  templates.  As  long  as  the  same  set  of  templates  are  used 
by  GIP  for  this  type  of  messages  there  would  not  be  any  problem.  The  output  from 
PLUM  has  to  go  back  to  the  UPARSE  component  of  GIP  with  some  modifications. 


16-9 


These  modifications  must  be  made  by  the  interface  system. 


Figure-5:  LeveI-0  Data-Flow  Diagram 
Certain  other  requirements  of  the  proposed  interface  are  given  below: 

(1)  The  interface  must  provide  a  mechanism  for  execution  of  PLUM  as  a  logical 
component  of  GIP(UPARSE)  that  can  be  used  for  at  least  one  type  of  messages. 

(2)  PLUM  should  be  executed  on  a  remote  site  for  efficiency  considerations. 

(3)  The  interface  should  not  place  any  extra  burden  on  the  user. 


5.  INTERFACE  DESIGN 

The  main  strategy  for  this  design  is  to  make  little  or  no  changes  to  the  existing 
systems,  namely,  GIP  and  PLUM.  The  interface  should  handle  each  transformation  of 
data  required  by  one  system  from  the  other. 
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5.1  Design  Decisions 


For  any  design  usually  a  number  of  design  decisions  have  to  be  made.  The  following 
design  decisions  are  made  with  the  justifications  stated  therewith. 

1.  Select  a  version  of  PLUM:  The  MUC-4  version  of  PLUM  is  selected  because  it  is 
recommended  by  BBN  on  the  basis  of  its  performance.  This  version  works  well  in  the 
client-server  mode  bypassing  the  x-window  based  user  interface.  The  client-server  is 
important  for  our  piupose,  because  we  want  to  run  the  server  on  one  machine  and  the 
client  on  another  machine.  More  specifically,  we  want  to  activate  the  client  process 
from  the  interface  system.  This  version  is  currently  activated  by  run-muc  and  loading 
demo-server-patch.Usp  into  LISP  as  described  in  Section-3. 

2.  Select  a  mode  from  GAB  and  MPA:  GAB  is  selected  because  an  MPA  is 
indirectly  generated  from  GAB  whereas  GAB  is  directly  available  through  compilation 
of  source  codes. 

3.  Select  a  communication  media  from  Event  Manager,  Cronus,  remote  copy  and  NFS 
mounting  and  a  mixture  of  these:  A  mixture  is  selected  because  Event  Manager  is 
used  by  GIP  and  Cronus  is  used  by  PLUM  and  these  can  not  be  easily  eliminated  or 
substituted  for  the  other.  NFS  mounting  of  NLP_INTERFACE_DIR  is  required  by  the 
UP  ARSE  component  of  GIP. 

4.  Select  a  Start-up  sequence:  The  PLUM  processes  can  be  started  either  from 
UPARSE_CONTROLLER  or  from  the  interface  system  or  independently.  PLUM 
should  be  started  independently  for  two  main  reasons:  (1)  That’s  the  easiest  way. 
That’s  the  way  PLUM  works  now.  (2)  PLUM  should  not  be  started  from 
UPARSE_CONTROLLER,  so  that  existing  GIP  operations  are  least  likely  to  be  altered 
in  the  processes  of  experimental  development  process. 
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5.2  Procedural  Design 


In  the  GIP  environment,  UPAJtSE_CONTROLLER  usually  creates  NLU 
Processes.  However,  a  better  strategy  for  this  interface  is  to  start  PLUM 
independently.  In  any  case,  PLUM  processes  should  be  able  to  communicate  with  the 
two  UPARSE  processes,  namely,  nlu_writer  and  nlu_reader.  Since  PLUM  will  be 
hard  to  modify,  this  interface  system  will  mediate  the  communication.  As  soon  as  the 
interface  process  is  created  it  should  be  registered  with  the  Event  Manager  in  order  to 
receive  messages  from  UPARSE.  This  can  be  done  by  sending  an  ACK  message  to 
UPARSE_CONTROLLER.  The  ACK  message  contains  the  standard  GIP  message 
header  fields,  followed  by  the  tag  field  ACK!!!.  The  format  of  the  ACK  message  is  as 
follows  (Sterling  1993;  4-14); 


[[SOURCE  NAME]] 
[[HOST  MACHINE]] 
[[MESSAGE  NAME]] 
[[ACK!!!]] 


«Category  of  process  sending  the  message. » 

«Name  of  the  machine  on  which  the  process  is  running.» 
«Name  of  message  being  built.» 

«Name  of  process  publishing  this  message.» 


The  nlu_writer  is  normally  responsible  for  assigning  messages  to  NLU  processes 
and  load  balancing  (Sterling  1993;4-37).  However,  it  would  be  hard  to  modify  PLUM 
so  that  it  receives  messages  directly  from  nlu_writer.  At  this  stage,  it  would  be  easier 
to  assign  messages  to  PLUM  and  balance  the  load  from  the  interface  system.  The 
following  procedural  description  is  developed  for  the  proposed  interface. 


begin 

1  Send  ACK  message  to  UPARSE_CONTROLLER  and  Wait  for  a  pathname_message 

2  REPEAT 

3  Extract  pathname  from  the  pai]iaame_message 

/*  Alternatively  a  default  pathname  can  be  assumed  for  this  version  */ 

4  Count  the  munber  of  UN_PROCESSED  PLUM_TYPE_MESSAGES  from  NLP_INTERFACE_DIR 

/*  NLU_INTERFACE_DIR  :  An  NFS  mounted  directory  in  the  GIP  environment,  usually. 
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/home/gip/GIP_LroRARY/UPARSE_WORKSPACE 


*/ 

/*  UN_PROCESSED_MESSAGE:  No  file  named  PROCESSED  exists  under  the  message 
directory,  usually.  /home/gip/GIP_LIBRARY/UPARSE_WORKSPACE/plum/msgXX  */ 
/*  PLUM_TYPE_MESSAGES  :  Messages  usually  stored  under 
/home/gip/GIP_LIBRARY/UPARSE_WORKSPACE/plum  */ 

5  If  the  number  is  0  sleep  for  30  seconds  and  go  to  step  4 

6  For  every  1-  4  PLUM_TYPE_MESSAGES  create  a  PLUM_PROCESS  on  a  remote  site 
and  assign  the  messages  to  the  processes  and  mark  the  messages  PROCESSED. 

/*  For  this  version  only  1-4  messages  are  expected  */ 

7  Check  /home/plum/TIPSTER/msg  directory  for  output  from  PLUM  and 
TRANSLATE  PLUM  output,  if  any.  and  write  to  NLP_INTERFACE_DIR 

/*  TRANSLATE:  For  each  message,  it  reads  from  a  single  output  file  in  the  PLUM 
envirorunent  (usually  /home/plum/TTPSTER/msg/msg-outX  )  and  writes  each  template 
to  a  distinct  file  in  the  GIP  environment  */ 

8  Publish  a  message  for  UP  ARSE  giving  the  pathname  of  the  results. 

/*  Alternatively  a  default  pathname  can  be  assumed  for  this  version  */ 

9  UNTIL  (halt  flag  set) 
end. 

6.  ESTIMATION  OF  SPEED-UP 


GIP  is  usually  executed  in  a  distributed  environment  where  several  computers  are 
networked  together  for  sharing  resources.  The  model  of  computation  for  this 
environment  is  known  as  the  client-server  model.  The  architecture  is  generally  known 
as  MIMD  (Multiple  Instruction  Multiple  Data  stream).  In  order  to  take  full  advantage 
of  available  resources  in  the  GIP  environment  its  modules  are  designed  for  distributed 
execution.  Since  data  extraction  from  free  text  needs  substantial  computing  power, 
distributed  execution  of  the  NLU  module  is  highly  desirable.  The  most  important 
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potential  benefit  from  distributed  execution  of  GIF  is  speed-up.  Unless  the  execution  of 
UPARSE  with  its  associated  NLU  modules  is  speeded-up  the  subsequent  processes 
such  as  SELECT  will  starve  under  certain  input  conditions.  Under  the  proposed 
interface  PLUM  will  be  executed  in  a  distributed  manner  in  order  to  minimize  the 
starvation  of  the  subsequent  processes.  The  exact  amount  of  speed-up  depends  on  the 
speed  of  the  involved  processors  and  various  overhead  costs.  The  amount  of  speed-up 
is  usually  sublinear  due  to  overhead  costs.  The  types  of  expected  overheads  are: 

(1)  Process  Creation  Overhead:  Extra  time  required  for  creating  additional  processes. 

(2)  Interprocess  Communication  Overhead:  Extra  time  required  for  communication 
between  processes. 

(3)  Synchronization  Overhead:  Extra  time  required  for  waiting  for  certain  events. 

PLUM  takes  about  42  seconds  on  Sun  Sparc  Station-2  for  processing  the  message 
given  in  Figure-2.  For  four  messages  of  this  type  it  took  about  four  times  longer.  If 
PLUM  executes  these  4  messages  on  four  machines  how  fast  can  it  finish?  The 
current  version  of  PLUM  is  not  yet  ready  for  distributed  execution.  Therefore  it  is  not 
possible  to  measure  the  exact  speed-up  of  distributed  execution  of  PLUM.  However, 
modeling  PLUM’s  behavior  in  some  programs  that  ran  on  4  Sun  workstations  it  is 
found  that  PLUM  is  expected  to  process  this  type  of  messages  about  3  times  faster 
utilizing  four  machines. 
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THE  GLOHRL  PULSE: 

POSSIBLE  RPPLICBTIQNSTO  SPERKER  IDENTIFICRTION 

Dr.  Beth  L.  ’  osieiuicz 
Rssistant  Professor 
Experimental  Psychology 
Colorado  State  Uniuersity 


Abstract 

One  of  the  major  current  thrusts  of  the  Speech  Processing  Lab  at  Rome  Labs 
is  speaker  identificiation.  There  is  euidence  that  the  uocal  fold  uibrations  (the 
glottal  pulse)  may  prouide  unique  information  capable  of  differentiating  betuieen 
speakers.  If  this  is  true,  glottal  pulse  information  could  be  used  to  substantially 
improue  current  speaker  identification  algorithms.  Rn  in-depth  literature  reuieui  of 
uihat  is  known  about  the  glottal  pulse  waueform  cooperates  this  euidence. 
Recommendations  are  made  about  appropriate  equipment  to  use  in  future 
inuestigations  of  glotto  ;iuise  waueforms  in  speaker  identification. 
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THE  GLOnBL  PULSE: 

POSSIBLE  RPPLiCRTIONSTO  SPERKER  IDENTIFICRTION 

Or.  Beth  L.  Losieiuicz 

Introduction 

Uoiced  speech  can  be  characterized  as  resuiting  from  a  glottal  Luaue  (source) 
passed  through  the  uocal  tract  (filter).  On  the  luhole,  speaker  identification 
algorithms  haue  not  attempted  to  analyze  the  glottal  uiaue  'source'  separately  from 
the  ouerall  speech  signal  in  which  the  effects  of  'source'  and  'filter'  are 
compounded.  Briesman  et.  al.,  (1987)  suggest  that  such  a  diuision  of  the  speech 
uiaueform  into  subcomponents  with  unique  physiological  origins,  (i.e.  glottal  'source' 
and  uocal  tract  'filter')  could  prouide  a  larger  set  of  descriptors  for  use  in  speaker 
identification  tasks.  Similar  suggestions  haue  been  made  by  other  researchers, 

(e.g.,  Miller,  1959;  Sondhi,  1975;  Monsen  et.  ai.,  1977;  Childers  and  Lee,  1991),  but 
few  such  attempts  are  reported  in  the  literature  (e.g.,  Curtis,  1974;  UJolf,  1972; 
Carrell,  1984). 

Euidence  that  the  glottal  pulse  may  be  an  essential  component  in  human 
perception  of  speaker  differences  comes  from  work  by  Holmes  (1973)  with  speech 
synthesis.  In  an  attempt  to  model  a  giuen  speaker  as  closely  as  possible  with 
synthesized  speech,  he  found  that  if  the  encitation  source  was  an  estimate  of  that 
speaker's  (inuerse-fiiter  deriued)  glottal  pulse,  the  resulting  synthesized  speech 
was  judged  as  more  natural  than  if  the  encitation  source  was  an  idealized  pulse 
shape  (e.g.,  combinations  of  cosine  segments).  Klatt  (1987)  also  reports  that  glottal 
pulse  characteristics  are  of  "considerabie  importance  to  speaker  identification," 
and  models  glottal  uariability  in  open  quotient,  spectral  tilt  and  breathiness  noise  in 
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his  Klatttalic  speech  synthesizer.  Vegnanarayan,  Naik  &  Childers  (1984)  also  report 
that  glottal  pulse  shape  has  a  large  role  in  quality  of  synthetic  speech,  more  so  than 
either  pitch  period  or  choice  of  uocal  tract  modelling  parameters. 

The  Glottis 

The  glottis  is  the  opening  between  the  uocal  folds  through  which  pulmonary 

air  pressure  passes  during  both  breathing  and  speech.  These  folds  consist  of  muscle 

and  mucous  membrane  tissue  which  stretch  between  the  thyroid  cartilage  and  the 

arytenoids.  During  breathing  the  glottis  is  fully  open,  allowing  air  to  pass  through 

freely.  In  uoiced  speech,  howeuer,  the  glottis  is  partially  or  completely  closed. 

Subglottal  air  pressure  causes  periodic  uibration  of  the  uocal  folds,  resulting  in  the 

strong  periodic  eucitation  that  driues  uoiced  speech.  The  speaker,  by  controlling  the 

shape  and  tension  of  the  uocal  folds,  can,  within  limits,  control  this  eucitation 

source.  The  periodic  pulse  generated  in  the  glottis  passes  through  the  pharyngeal 

cauity  to  the  mouth,  (and,  if  the  nasal  passages  are  open,  into  the  nasal  cauity  as 

well).  By  uarying  the  mouth  cauity  and  the  uarious  articulators  (tongue,  teeth,  soft 

palate,  etc.)  the  speaker  shapes  the  glottal  waue,  augmenting  some  frequencies  and 

attenuating  others,  yielding  the  complex  waueforms  of  speech. 

The  Glottal  Pulse 

The  moment  of  glottal  closure  coincides  with  the  peak  excitation  of  the  uocal 
tract  (Miller,  1959),  although  excitation  can  occur  without  complete  glottal  closure 
(Holmes,  1962;  cited  in  Rosenberg,  1970).  Dampening  of  the  excitation  occurs 
during  the  succeeding  closed-glottis  phase,  as  a  result  of  energy  absorption  by  the 
uocal  tract.  Secondary  excitation  occurs  at  the  point  of  glottal  re-opening  (Miller, 
1959;  Childers,  et.  ai.,  1991). 

The  source-filter  model  of  the  uocal  apparatus  is  often  simplified  to  exclude 
results  of  interaction  between  the  source  and  the  filter,  and  the  idealized  glottal 
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pulse  is  represented  as  a  smoothly  uarying  signal  encept  for  discontinuous  slopes  at 
the  endpoints.  This  is  not  completely  accurate.  In  fact  Bickley  and  Steuens  (1986) 
report  significant  indiuidual  differences  In  the  effect  of  uocal  tract  loading  on  the 
glottal  luaueform.  Rothenberg  (1985)  reports  that  -The  lorn  frequency  inertance  of 
the  uocal  tract . . .  can  cause  a  skewing  of  the  glottal  flow  pulse  to  the  right. 

Monsen  &  Engebretson  (1977)  reports  dips  in  the  waueform  at  the  fourth,  fifth  and 
sixth  harmonics,  presumably  resulting  from  the  interaction  of  the  glottal  airflow 
with  the  acoustic  loading  of  the  uocal  tract  (Milenkouic,  1986).  Klatt  and  Klatt 
(1990)  report  “ripple  components"  at  the  FI  frequency  due  to  pharyngeal  pressure, 
and  Fant  (1985)  reports  “superposition  of  formant  ripple  from  preulous  glottal 
periods,"  which  uaries  with  closed  phase  duration,  being  most  prominent  after 
short  closed  phases.  Sub-glottal  and  supra-glottal  resonances  also  influence  glottal 
pulse  shape  (Scordilis  &  Gowdy.  1990;  Fant,  1985),  and  glottal  pulse  shape  can 
reflect  out-of-phase  uibratlon  of  the  top  and  bottom  surfaces  of  the  uocal  folds 

(Klatt,  1987). 

sniirces  of  n»  nttal  Pulse  UariabUim 

The  glottal  pulse  can  uary  on  a  number  of  different  parameters:  pulse  width, 
pulse  skewedness  (ratio  of  duration  of  open  and  closed  phases),  closure  abruptness, 
symmetry  of  pulse,  shape  of  pulse  peak,  rms  intensity,  pulse  phase,  source 
spectrum  slope,  turbulent  noise  component,  etc.  (Rosenberg,  1970;  Miller,  1959; 
Childers,  et.  al.,  1991;  lUolf,  1972;  Klatt  &  Klatt,  1990;  Monsen  &  Engebretson,  1977; 

Scordilis  &  Gowdy,  1990). 

This  uariance  is  caused  by  complex  interactions  of  physiological  and  motor 
control  determinants,  including  shape  and  size  of  the  glottal  opening,  subglottal 
resonances,  subglottal  air  pressure,  uocal  fold  elasticity  and  thickness,  phase 
relations  of  upper  and  lower  portions  of  uocal  fold,  and  certain  supra-glottal 
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characteristics  (e.g.,  of  the  trachea  and  supra-glottal  uocai  tract).  RIthough 
physiological  determinants  are  relatiuely  static  within  a  giuen  speaker  (e.g.,  sub- 
giottal  resonance  characteristics,  uocai  fold  elasticity,  etc.),  most  motor  control 
parameters  haue  an  effectiue  range  of  intra-speaker  uariance.  The  speaker  can, 
and  does,  control  the  uarious  motor  parameters  during  speech,  both  indiuidually  and 
in  compleK  interactions.  Intonation,  syllable  stress,  phonation  type  (e.g.,  creaky 
uoice,  breathy  uoice,  falsetto,  and  normal  uoice),  uowel  type  and  amplitude 
intensity  ail  haue  glottal,  and  hence  glottal  waueform  implications. 

If  glottal  pulse  characteristics  are  to  be  usable  as  cues  to  speaker 
recognition,  those  aspects  of  the  glottal  pulse  which  haue  greater  inter-  than  intra¬ 
speaker  uariability  need  to  be  identified.  Only  a  handful  of  studies  haue  attempted 
to  distinguish  between  characteristics  that  are  inter-  uersus  intra-  speaker 
uariable,  mostly  focusing  on  male/female  differences.  R  brief  reuiew  of  known 
sources  of  inter-  and  intra-speaker  uariance  in  glottal  pulse  characteristics  follows. 
inter-SoeakerUariabilitu:  Male/Female  Differences 

RIthough  it  has  long  been  known  that  there  are  mean  F0  differences  between 
male  and  female  speakers,  only  a  handful  of  studies  haue  inuestigated  other  glottal 
pulse  characteristics  of  male  uersus  female  speech.  Monsen  and  Engebretson 
(1977)  found  seueral  such  glottal  pulse  attributes.  First,  the  shape  of  the  glottal 
pulse  for  females  was,  on  the  whole,  more  symmetrical  than  the  male  pulse,  with 
the  male  pulse  often  euincing  a  pronounced  'hump'  in  the  opening  portion  of  the 
waue.  They  hypothesized  that  this  results  from  a  "slight  out-of-phase  mouement 
of  the  upper  and  lower  uocai  fold."  Rs  the  FB  of  a  male  speaker  rises,  this  'hump' 
becomes  less  prominent,  and  the  pulse  becomes  more  symmetrical.  Intensity  also 
affects  pulse  symmetry.  Soft  (i.e.,  not  loud)  male  articulations  also  tend  to  lose  the 
characteristic  "hump." 
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Not  surprisingly,  Monsen  and  Engebretson  also  found  that  auerage  rms 
Intensity  is  lower  for  female  than  male  unices,  with  an  auerage  drop  of  -6  dB.  The 
most  striking  difference  they  found  between  male  and  female  glottal  pulses, 
howeuer,  was  a  difference  in  harmonic  relations,  namely  that  the  amplitude 
attenuation  slope  at  high  frequencies  was  steeper  for  females  than  for  males. 

UJhile  the  auerage  fall-off  rate  for  males  was  approximately  -10  dB  per  octaue,  the 
fall-off  rate  for  female  unices  aueraged  -15  dB  per  octaue.  Martony  (1965)  and 
lUolf  (1972)  also  found  significant  inter-speaker  differences  in  the  high  frequency 
slope  of  the  source  spectrum.  The  range  of  intra-sex  uariance  in  slope  in  the 
Monsen  study,  howeuer,  was  fairly  large,  especially  in  connection  with  intensity 
differences.  Steeper  declines  in  the  spectrum  slope  occurred  during  soft  (as 
opposed  to  normal  or  loud)  articulation  (intra-speaker). 

The  results  of  Monsen  and  Engebretson  also  hint  at  the  possibility  that  the 
mean  F0  change  for  rising  and  for  falling  intonations  may  be  different  for  males  and 
females. 

Klatt  and  Klatt  1990  found  male/female  differences  in  the  propensity  to 
different  phonation  types.  RIthough  there  was  large  inter-speaker  uariabiiity 
within  sex,  female  speech  was  more  apt  to  be  characterized  by  breathy  uoice,  while 
creaky  uoice  was  more  likely  to  occur  in  male  speech.  Instances  of  breathy  uoice 
also  increased  for  unstressed  syllables,  utterance-final  syllables,  and  at  the  margin 
of  uoiceless  consonants. 

Intra-Soeaker  Uariabiiity 

Higher  ieuei  linguistic  factors  can  condition  changes  in  the  glottal  waueform 
within  speakers.  Sentence-  and  discourse-leuei  intonation  contours  depend  on  F0 
changes.  Interrogatiue  prosody  is  characterized  by  a  sharp  final-syllable  rise  in  F0, 
while  the  final  syllable  of  a  declaratiue  utterance,  as  well  as  stressed  syllables,  are 
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characterized  by  falling  FO.  Syllable  and  word  stress  itterr'?,  also  affect  the 
glottal  uiaue  shape. 

Speakers  can  also  uary  glottal  characteristics  to  achieue  uariations  in  uocal 
quality.  Some  languages  eupioit  phonation  differences  as  contrast  iue  linguistic 
parameters  (Klatt  &  Klatt  199Q).  but  this  is  not  the  case  in  English.  Thus  in  English 
phonation  type  is  free  to  signal  indiuidual  uoice  characteristics,  sen  of  speaker,  or 
pragmatic  information.  There  is  also  euidence  that  the  glottal  waueform  changes 
with  the  uowel  being  uocalized  (Cranen  and  Bouis,  1985;  Rnanthapadmanabha  and 
Fant,  1982;  Fant,  1985). 

Glottal  Pulse  Uariables  in  Sneaker  Identification 

In  spite  of  the  large  number  of  parameters  on  which  glottal  source 
characteristics  can  differ  within  the  utterances  of  a  single  speaker,  there  is  reason 
to  belieue  that  such  uariability  may  be  sufficiently  constrained  to  allow  inter¬ 
speaker  differences  to  be  used  in  speaker  identification. 

Miller  (1959)  claimed  that  indiuidual  speakers  tend  to  keep  pulse  shape  and 
width  constant  while  uarying  closure  period.  "This  keeps  the  frequency-energy 
distribution  essentially  constant  as  the  pitch  uaries,  thus  presumably  hauing  less 
effect  on  the  formant  ualues."  If  this  is  the  case,  pulse  shape  and  width  might  be 
candidate  parameters  for  speaker  identification  algorithms.  Milenkouic  (1993),  in 
contrast,  emphasizes  the  uariability  of  glottal  open  time  within  speakers. 

The  degree  of  FB  rise  due  to  linguistic  stress,  on  the  other  hand,  may  not 
prouide  good  inter-speaker  differentiation.  lUolf  (1972)  claimed  that  in  his  data  set 
(21  adult  male  speakers)  the  range  of  intra-speaker  uariation  on  this  parameter 
was  so  great  as  to  make  it  an  unlikely  candidate  for  distinguishing  between 
speakers.  He  did  predict  that  auerage  F0,  on  the  other  hand,  would  be  a  good 
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differentiator  between  speakers,  and  Rtal  (1972)  ciaims  that  pitch  contour  is  a 
usabie  parameter  in  speaker  identification. 

Same  Effect.  Different  Cause:  Possibie  Sneaker  Identification  Parameters 

There  are  seuerai  known  instances  in  which  a  singie  perceptuai  resuit  can  be 
caused  by  more  than  one  type  of  giottai  waueform.  Monsen  and  Engebretson 
(1977),  for  enampie,  found  that  creaky  uoice  phonation  couid  resuit  from  two 
different  types  of  glottal  actiuity.  One  type  was  characterized  by  slack  uocai  folds, 
low  subgiottal  air  pressure  and  wide,  random,  uariations  in  pulse  period  (for  one 
subject,  three  adjacent  pulses  had  periods  of  33  ms,  1 1  ms,  and  18  ms).  The  other 
type  of  creaky  uoice  was  produced  by  double  pulses  of  unequal  duration.  Hamlet 
(1971)  also  reports  more  than  one  type  of  creaky  uoice.  It  is  possible  that 
characteristic  use  of  creaky  uoice,  as  well  as  type  of  creaky  uoice  used  (perhaps 
interacting  with  other  factors,  such  as  linguistic  or  intonation  content)  couid  be  a 
usable  parameter  for  uoice  recognition,  although  one  that  would  presumably  be 
susceptible  to  intentional  deception  (i.e.,  uoice  disguise). 

Rnother  case  of  a  similar  perceptual  effect  arising  from  different  giottai 
patterns  arises  in  FO  changes.  Monsen  and  Engebretson  (1977)  found  that  FB  change 
was  either  characterized  by  i )  an  enuelope  shift  in  frequency  and  amplitude,  with  a 
concomitant  maintenance  of  harmonic  relations,  or  2)  a  change  in  harmonic 
relations  with  concomitant  stability  of  the  frequency  and  amplitude  enuelope.  They 
found  that  when  the  F0  rise  was  associated  with  interrogatiue  pitch  contour,  the 
spectral  enuelope  retained  its  shape  (as  the  harmonic  relations  become  steeper) 
whereas  the  (smaller)  rise  in  F0  associated  with  syllable  stress  was  associated  with 
a  shift  in  amplitude  and  frequency,  maintaining  harmonic  relations.  The  fall  in  F0 
associated  with  a  declaratiue  pitch  contour  was  accompanied  by  the  same  giottai 
waue  changes  as  the  F0  rise  due  to  syllable  stress.  They  concluded  that  different 
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types  of  glottal  changes  characterized  different  types  of  FB  change.  It  might  be 
worthwhile  to  Inuestigate  whether  this  distinction  holds  across  speakers,  or 
whether  speakers  show  a  preference  for  one  type  of  FB  shift  mechanism  ouer  the 
other.  If  they  do,  this  might  be  exploited  in  speaker  identification  algorithms. 
Measuring  fEstlmatinol  The  Glottal  Pulse 

Measuring  (or,  more  commonly,  estimating)  the  glottal  pulse  is  not  triuiai. 
Direct  measures,  usually  rather  inuasiue,  include  high  speed  photography  (Childers, 
Mott  and  Moore,  I98B),  transillumination,  electromyography,  ultrasound  (Hamlet, 
1971),  or  direct  measure  of  the  glottal  air  flow.  Simulations  haue  been  carried  out 
with  eucised  human  and  canine  larynges,  and  with  computer  models  of  the  uocal 
fold  oscillation.  Sondhi  (1975)  and  Monsen  &  Engebretson  (1977),  directly  measured 
the  glottal  pulse  using  a  reflectionless  metal  tube  as  a  pseudo-infinite  eutension  of 
the  uocal  tract. 

Indirect  measurements  are  accomplished  through  inuerse  filtering  techniques 
that  attempt  to  factor  out  the  uocal  tract  contribution,  leauing  the  glottal  pulse  as  a 
residual  (Briesman,  et.  al.,  1987;  Matausek  &  Batalou,  198B;  Milenkouic,  1987; 
Mllenkouic,  1993;  Rosenberg,  1971;  RIku  &  Laine,  1989;  Isaksson  0  Millnert,  1989; 
Schoentgen,  199B;  Strube,  1974;  Rnanthapadmanabha  0  Fant,  1982).  On  the  whole 
these  indirect  estimates  haue  correlated  well  with  the  results  of  more  direct 
measures,  aithough  careful  attention  to  filter  adjustment  is  crucial.  Euen  with  an 
optimal  filter,  there  is  a  large  potential  for  confusion  of  glottal  waue  and  formant 
oscillations  for  uowels  with  low  F1,^  and  nasalized  uoweis  are  not  amenable  to  this 
type  of  inuerse  filtering  (which  only  models  the  effect  of  the  uocal  tract). 
Milenkouic  (1986)  reports  filter-induced  artifactual  mimicking  of  glottal  non- 

^But  see  Cranen  and  Bouis  (1985)  for  an  alternate  enplanation  of  this  difficulty.  Their 
suggestion  that  low  uowels  haue  a  different  glottal  waueform  might  enplain  this  difficulty  in 
obtaining  a  good'  glottal  pulse  for  low  FI  uowels  by  inuerse  filtering. 
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closure.  Briesman,  et.  al.  (1987)  suggest  testing  the  inuerse  filter  on  a  uiaueform 
with  a  known  (synthesized)  glottal  waueform  before  eKperimentation. 

The  set  of  glottal  waueform  measurements  proposed  by  Fant  (1985)  are 
illustrated  in  Figure  I. 

Recommended  Methods  for  Further  Research 

Seueral  possible  auenues  for  inuestigating  the  usefulness  of  glottal  pulse 

information  in  speaker  identification  were  considered:  Electroglottograph  (EGG) 

signals;  a  software  inuerse  filter;  a  Sondhi  tube;  and  an  analog  (hardware)  inuerse 

filter  with  a  Rothenberg  Mask  (Rothenberg,  1977). 

The  Rir  Force  Rcademy  has  prouided  a  database  to  the  Speech  Processing  Lab 

which  contains  parallel  speech  and  electroglottograph  (EGG)  signals  from  a  uariety 

of  speakers.  RIthough  the  two  signals  were  intended  to  be  time-aligned,  they  are 

not  in  fact  so  aligned,  being  offset  by  approKimately  25  msec.  Since  the  giottis-to- 

microphone  trauel  time  should  only  be  about  1-2  msec,  this  asynchrony  was 

presumably  caused  by  transmitter  characteristics.  RIthough  the  signals  could  be 

time-aligned  retrospectiuely,  this  is  a  less  than  optimal  solution. 

This  EGG  data  was  not  chosen  as  the  best  method  for  inuestigating  the  glottal 

pulse  for  a  slightly  different  reason,  howeuer.  The  EGG  is  a  measure  of  uocal  fold 

contact  area,  and  as  such  fails  to  giue  any  detailed  information  about  airflow  during 

the  open  period  of  the  glottal  pulse.  This  is  reflected  in  flattened  peaks  in  the 

signal.  Since  inuerse  filtering  offers  more  detailed  information,  and  since  it  is  more 

c 

likely  to  be  applicable  to  signals  obtained  from  existing  ommunication  channels,  it 

A 

was  deemed  to  be  a  preferable  technique  for  present  purposes. 

R  Sondhi  Tube  (Sondhi,  1975)  was  also  rejected  as  a  method  for  obtaining 
glottal  pulses,  primarily  because  it  is  not  certain  what  effect  the  tube  has  on  the 
speaker's  articulation. 
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UJe  were  unable  to  locate  a  pre-programmed  software  inuerse  filter  to 
estimate  the  glottal  pulse.  Since  the  literature  reports  that  filter  adjustment  is 
critical,  we  decided  not  to  uenture  into  the  business  of  deueloping  and  programming 
our  own  filter. 

Rn  analog,  hardware,  inuerse  filter  is  auailable  commercially  from  Glottal 

Enterprises  in  Syracuse,  New  York.  This  fiiter  was  deueioped  by  Rothenberg  (1977), 

ao<( 

and  is  marketed  with  a  uolume  uelocity  flow  mask  whiett- has  been  accepted  by  the 
scientific  community  as  a  reliable  way  to  obtain  the  airflow  to  be  inuerse  filtered. 
(Its  main  aduantage  is  that  it  obuiates  the  need  for  an  eupensiue  low-frequency 
response  microphone  needed  to  obtain  accurate  low  freqfe^cy  glottal  pulse 
information  from  a  pressure  waue.)  The  mask  and  filter  units  are  ineupensiue 
(approMimately  $2,300)  and  it  was  decided  that  it  prouided  the  best  cost/reliability 
ratio  for  the  proposed  research. 

It  should  be  noted  that  the  waueform  obtained  from  the  uolume  uelocity 
airflow  contains  more  accurate  glottal  pulse  information  than  is  auailable  from  the 
signals  generally  found  in  real-life  speaker  identification  applications.  RC 
microphones  seriously  distort  the  low  frequency  information  of  the  speech  signal, 
thus  a  pristine  glottal  source  waueform  is  not  obtainable  with  speech  signals  as 
transmitted  by  most  current  communications  equipment.  Howeuer,  it  was  decided 
that  it  would  be  preferable  to  begin  an  inuestigation  with  the  'best  case'  scenario, 
inuestigating  whether,  in  principle,  the  glottal  pulse  information  can  differentiate 
between  speakers.  Then,  if  we  find  that  It  does  differentiate,  further  work  could 
deuelop  more  direct  applications.  If  we  fail  to  find  differentiating  parameters,  we 
would  be  confident  that  we  had  made  our  best  attempt  to  discouer  any  that  might 
euist. 


17-12 


Conclusions: 


RIthough  sparse,  the  auailable  information  about  inter-speaker  differences  in 
the  glottal  pulse  uiaueform  suggests  that  it  may  be  a  useful  parameter  for  speaker 
identification  aigorithms.  It  is  recommended  that  the  airflouj  mask  and  inuerse 
fiiter  deueloped  and  marketed  by  Glottal  Enterprises  in  Syracuse,  Neuj  York  be  used 
in  future  glottal  pulse  research  originating  from  this  lab,  as  the  most  cost  effectiue 
and  accurate  way  to  obtain  the  glottal  uiaueform  for  speaker  identification 
research. 


17-13 


REFERENCES 


Rnanthapadmanabha,  T.  and  G.  Fant.  (1982).  Calculation  of  true  glottal  flow  and  its 
components.  Speech  Communication.  1.  167-184. 

Rtal.  B.  (1972).  Rutomatic  speech  recognition  based  on  pitch  contours.  Joumai  of 
the  Acoustical  Societu  of  Rmerica.  52.  1687-1697. 

RIku,  P.  0  U.  Laine.  (1989).  R  new  glottal  LPC  method  for  uoice  coding  and  inuerse 
filtering.  IEEE  International  Symposium  on  Circuits  and  Systems,  uol.  3,  1831- 
1834. 

Bickley,  C.  &  K.  Steuens.  (1986).  Effects  of  uocai-tract  constriction  on  the  glottal 
source:  EKperimental  and  modelling  studies.  Joumai  of  Phonetics.  14.  373-382. 

Briesman,  N.,  C.  Thorpe  &  R.  Bates.  (1987).  Nontactile  estimation  of  glottal 

eKcitatlon  characteristics  of  uoiced  speech.  lEE  Proceedings.  134.  807-813. 

Carrell,  T.  (1984).  Contributions  of  fundamental  frequency,  formant  spacing  and 
glottal  waueform  to  talker  identification.  Speech  Research  Lab  Progress  Report 
5,  Indiana  Uniuersity,  Bloomington. 

Childers,  D.  &  C.  Lee.  (1991).  Uocal  quality  factors:  Analysis,  synthesis  and 
perception.  Joumai  of  the  Acoustical  Society  of  America.  2394-2410. 

Childers,  D.,  J.  Mott  &  G.  Moore.  (1988).  Rutomatic  parameterization  of  uocal  cord 
motion  from  ultra  high  speed  films.  Proc.  ICRSSP.  1,  65-68. 

Cranen,  B.  and  L.  Boues.  (1985).  Rerodynamic  aspects  of  uoicing:  Glottal  pulse 
skewing  reuisited.  ICRSSP,  Proc.  of  the  IEEE  Infl  Conf.  on  RSSP,  1085-1088 

Curtis,  T.  (1974).  inter  speaker  uersus  Intra  speaker  uariability  of  glottal  pulse 
shapes.  Joumai  of  the  Acoustical  Societu  of  America.  55. 462. 

Fant,  G.  (1985).  The  uoice-source  theory  and  acoustic  modeling.  In  I.  Titze  &  R. 
Scherer,  eds.,  Uocal  Fold  Phusioloou:  Biomechanics.  Rcoustics  0  PhonatOPi 
Control.  The  Oenuer  Center  For  the  Performing  Arts,  Denuer,  Co.,  453-464. 

Geisler,  C.  D.,  &  S.  Silkes.  (1991).  Responses  of  "lower-spontaneous-rate"  auditory 
nerue  fibers  to  speech  syllables  presented  in  noise,  il:  Glottal-pulse 
periodicities.  Joumai  of  the  flcoustical  Societu  of  America.  3140-3148. 

Hamlet,  S.  (1971).  Location  of  slope  discontinuities  in  glottal  pulse  shapes  during 
uocal  fry.  Joumai  of  the  Acoustical  Societu  of  Rmerica.  50.  1561-1562. 


17-14 


Holmes,  J.  (1973).  The  influence  of  glottal  uiaueform  on  the  naturalness  of  speech 
from  a  parallel  formant  synthesizer.  IEEE  Transactions  on  fludio  and 
Electroacoustics.  HU-21.  298-305. 

Isaksson,  fl.  &  M.  Millnert.  (1989).  Inuerse  glottal  filtering  using  a  parameterized 
input  model.  Slonal  Processing.  18.435-445. 

Klatt,  0.  (1987)  Reuieui  of  tent-to-speech  conuersion  for  English.  Journal  of  the 
flcoustical  Societu  of  Rmerica.  82.  747-793. 

Klatt,  D.  &  L.  Klatt.  (1990).  Rnalysis,  synthesis,  and  perception  of  uoice  quality 
uariations  among  female  and  male  talkers.  Journal  of  the  flcoustical  Society  of 
America.  fiL  820-857. 

Martony,  J.  (1965).  Studies  of  the  uoice  source.  Speech  Transmission  Lab,  Royal 
institute  of  Technology,  Stockholm,  Quarterly  Progress  Status  Report  1,4-9. 

Matausek,  M.  &  U.  Bataiou.  (1980).  R  neiu  approach  to  the  determination  of  the 
glottal  utaueform.  IEEE  Transactions  on  Acoustics.  Speech,  and  Signal  Processing. 
RSSP-28,  616-622. 

Milenkouic,  P.  (1986).  Glottal  inuerse  filtering  by  joint  estimation  of  an  RR  system 
uiith  a  linear  input  model.  IEEE  Transactions  on  Acoustics.  Speech  and  Signal 
Processing.  flS.SE.-5-4..  28-42. 

Milenkouic,  P.  (1993).  Uoice  source  model  for  continuous  control  of  pitch  period. 
Journal  of  the  flcoustical  Societu  of  America.  93.  1087-1095. 

Miller,  M.  and  M.  Sachs.  (1984).  Representation  of  uoice  pitch  in  discharge  patterns 
of  auditory-nerue  fibers.  Hearing  Research.  14.  257-279. 

Miller,  R.  (1959).  Nature  of  the  uocal  chord  uiaue.  Journal  of  the  flcoustical  Society 
of  Rmerica.  31.  667-677. 

Monsen,  R.  &  M.  Engebretson.  (1977).  Study  of  uariations  in  the  male  and  female 
glottal  ujaue.  Journal  of  the  flcoustical  Society  of  America.  62.  981-995. 

Ricart,  R.,  J.  Cupples  &  L.  Fenstermacher.  1993. 

Rosenberg,  R.  (1970).  Effect  of  glottal  pulse  shape  on  the  quality  of  natural  uouiels. 
Journal  of  the  flcoustical  Society  of  Rmerica.  49.  583-590. 

Rothenberg,  M.  1977.  Measurement  of  airfloui  in  speech.  Journal  of  Speech  and 
Hearing  Research.  20,  155-176. 

Rothenberg,  (1985).  Source-Tract  Acoustic  Interaction  in  Breathy  Uoice.  In  I.  Titze 
&  R.  Scherer,  eds.,  Uocal  Fold  Phusioloou:  Biomechanics.  Acoustics  &  Phonatoru 
Control.  The  Denuer  Center  For  the  Performing  Arts,  Denuer,  Co.,  155-165. 


17-15 


Schoentgen,  J.  (199B).  Non-linear  signal  representation  and  its  application  to  the 
modeling  of  the  glottal  maueform.  Speech  Communication,  i  1 89-20 1 . 

Scordilis,  M.  and  J.  Gouidy.  (1990).  Effects  of  the  uocal  tract  shape  on  the  spectral 
tilt  of  the  glottal  pulse  ujaueform.  Proceedings  of  the  IEEE.  Session  2E2,  86-89. 

SondhI,  M.  1975.  Measurement  of  the  glottal  maueform.  Journal  of  the  Acoustical 
Society  of  America.  57.  228-232. 

Strube,  J.  (1974).  Determination  of  the  instant  of  glottal  closure  from  the  speech 
maue.  Journal  of  the  Ocoustical  Society  of  America.  56.  1625-1629. 

lUolf,  Jared.  (1972).  Efficient  acoustic  parameters  for  speaker  recognition.  Journal 
of  the  Acoustical  Society  of  flmerica.  51.  2044-2056. 

Vegnanarayana,  B.,  J.  Naik,  &  D.  Childers.  (1984).  Uoice  Simulation:  Factors  Rffecting 
Quality  and  Naturaliness.  Tenth  International  Conference  on  Computational 
Linguistics,  Stanford,  California. 


17-16 


Losieuiicz 


FIGURE  1 :  Suggested  measurement  parameters  for  glottal  pulse 
From  Fant,  1985. 
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Abstract 

An  investigation  of  artificial  neural  networks  (ANN)  in  the  candidate  selection  of  auto 
source  update  (ASU)  program  was  conducted.  Among  many  types  of  ANNs  the  emphasis  was 
given  to  the  probabilistic  neural  network  (PNN)  architecture.  As  compared  to  other  types  of 
ANNs,  PNN  has  proved  to  be  reliable  and  superior  in  terms  of  speed  of  operation  and  simplicity 
of  adaptation  process.  A  PNN  network  was  designed  and  implemented  to  find  the  best  match  and 
its  confidence  estimate  for  a  given  message  among  several  possible  candidates.  Different 
experiments  were  conducted  on  a  set  of  messages  and  candidates  from  the  Defense  Mapping 
Agency  (DMA)  databases  to  check  the  performance  of  the  PNN.  The  results  have  been  successful 
and  promising.  Future  enhancement  of  the  proposed  PNN  and  other  ANN  methodologies  in  ASU 
program  has  also  been  presented. 
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ARTMCIAL  NEURAL  NETWORK  INVESTIGATION 
IN  AUTO  SOURCE  UPDATE  PROGRAM 

Mohamad  T.  Musavi  and  Walter  P.  Sweeney  Jr. 

1 .  INTRODUCTION 

The  objective  of  this  project  was  to  investigate  application  of  artificial  neural  networks 
(ANN)  in  support  of  the  Auto  Source  Update  (ASU)  program  which  is  currently  under 
development  by  the  Grumman  Data  Systems  Corporation,  Woodbury,  New  York.  The  ASU  is  a 
prototype  software  capability  for  maintaining  the  accuracy  and  currency  of  vectorized  geographical 
information  using  Defense  Mapping  Agency  (DMA)  data  [1]  and  other  source  material  as  input 

1 . 1  Auto  Source  Update  (ASU) 

The  accuracy  and  currency  of  the  DMA  databases  prove  to  be  very  critical  in  accurately 
identifying  different  segments  in  geographical  regions  of  interest  The  accuracy  issue  depends 
primarily  on  the  original  DMA  databases  and  the  currency  issue  concerns  rapid  and  dynamic 
changes  in  a  geographical  area  due  to  sudden  events  during  time  of  conflict.  Some  changes  occur 
so  rapidly  that  the  MCG&I  (Mapping,  Charting,  Geopositioning  and  Imagery)  database  supplied 
by  DMA  are  altered  significantly  in  a  matter  of  minutes  or  hours.  Under  these  conditions,  one  can 
not  solely  rely  on  the  original  DMA  database  for  planning  purposes.  An  updating  system  is 
required  to  incorporate  new  changes  to  be  used  with  the  original  database  in  a  time  efficient 
manner.  Dependence  of  a  large  number  of  Air  Force  systems  on  the  cartographic  information, 
signifies  the  imponance  of  an  updating  system. 

The  objective  of  the  ASU  program  [2, 3]  is  to  create  such  an  updating  environment  for  the 
vectorized  information  supplied  by  DMA.  From  an  external  input-out  point  of  view,  the  ASU 
program  can  be  viewed  as  a  black  box  that  accepts  inputs  from  two  different  sources  of 
information.  (1)  the  update  message  data  and  (2)  the  original  database.  The  message  data  is  an 
update  request  obtained  from  the  dynamic  source  materials.  The  original  database  can  be  any 
vectorized  data  from  the  DMA  databases.  The  output  of  the  ASU  is  the  appropriate  correction 
and/or  update  of  the  original  database  for  the  given  message.  The  ASU  internal  organization 
consists  of  three  main  building  blocks:  (1)  scenario  generation,  (2)  candidate  retrieval,  and  (3) 
candidate  selection.  Central  to  the  ASU  design  is  the  ASU  database  which  serves  as  a  data 
depository  for  different  operations  and  contains  data  from  different  external  sources  in  Neutral 
Update  Format  (NUF).  NUF  is  a  format  used  by  the  Grumman  Data  System  in  the  ASU  project. 
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The  scenario  generation  block  provides  a  simulation  environment  for  generation  of  update 
messages  to  be  used  in  training  and  testing  of  the  ASU  program.  It  consists  of  a  message 
generator  and  message  handler.  Both  will  work  with  data  in  Neutral  Update  Format  (NUF).  The 
candidate  retrieval  block  will  accept  update  request  as  input  and  will  produce  a  set  of  candidates  for 
the  given  message.  The  candidates  are  retrieved  from  the  ASU  database.  The  candidate  selection 
block,  which  is  the  focus  of  this  study,  will  accept  the  message  and  its  associated  candidates  and 
identify  the  most  appropriate  candidate  for  the  given  message. 

1 . 2  Candidate  Selection 

The  candidate  selection  block  of  the  ASU  program  requires  intelligence  and  adaptation 
process  to  select  the  best  possible  candidates  among  all  given  candidates.  The  objective  of  this 
research  is  to  investigate  an  optimal  artificial  neural  network  architecture  that  can  effectively  be 
utilized  in  this  task. 


2 .  PROJECT  DESCRIPTION 

The  objective  of  this  project  is  to  apply  artificial  neural  network  (ANN)  technology  in 
candidate  selection  of  the  ASU  program. 

2. 1  Message  and  Candidates 

The  candidate  selection  (CS)  module  should  accept  message  data  and  candidates'  data  as 
input  and  then  find  the  best  match  for  the  given  message  among  all  possible  candidates.  The 
message  data  is  extracted  from  dynamic  update  sources  such  as  evaluated  intelligence  products. 
Electronic  Chart  Updating  Manual  (ECHUM),  raw  imagery  (e.g.  LANDSAT  TM,  panchromatic 
SPOT,  scanned  NHAP,  infrared  RADAR,  SAR,  and  etc.),  native  cartographic  source  materials 
(c.g.,  paper  road  maps),  intelligence  databases,  and  any  other  update  source  within  the  Air  Force. 
These  update  messages  are  normally  in  varying  United  States  Message  Text  Format  (USMTF) 
types  (e.g.,  RECC^XREP,  UR,  etc.).  The  ASU  program  will  reformat  the  information  into  the 
Neutral  Update  Format  (NUF)  before  submitting  it  to  the  candidate  selection. 

A  message  data  in  NUF  is  normally  a  set  of  points,  given  in  latitude  (LAT)  and  longitude 
(LON)  degree  coordinates,  from  segments  such  as  roads,  runways,  lakes,  buildings,  and  etc.  in  a 
geographical  area  of  interest.  Each  message  consists  of  one  of  four  features:  (1)  point,  (2)  line,  (3) 
polygon,  or  (4)  area.  Each  feature  can  have  both  numerical  and  categorical  attributes  associated 
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with  it.  These  attributes  include  feature  attribute  coding  system  (FACS)  identifier,  surface  marffrial 
category  (SMC),  predominant  height  (HGT),  width  (WID),  and  etc.  For  example,  for  a  forest 
polygon  FACS  =  5C030  (i.e.  woodland),  SMC  =  12  (i.e.  trees),  and  HGT  =  15  meters  above 
ground  level  (AGL).  While  the  attributes  of  a  bridge  can  have  FACS  =  1Q040  (i.e. 
bridge/overpass/viaduct),  SMC  =  2  (i.e.  pan-metal),  HGT  =  20  meters  AGL,  and  WID  =  10 
meters.  Feature  attributes  are  normally  given  in  the  beginning  of  a  NUF  message  for  identification 
purposes.  The  appendix  at  the  end  of  this  report  gives  a  print-out  of  a  NUF  message  and  Rgure  1 
presents  the  same  message  points  plotted  as  a  curve. 

The  candidates  data  for  a  given  message  are  provided  by  the  candidate  retrieval.  The 
candidate  retrieval  extracts  appropriate  candidates  from  the  ASU  database  based  on  the  NUF 
message.  The  candidates  also  are  in  NUF  and  have  the  same  structure  as  the  message.  A  set  of 
candidates  for  the  message  of  Figure  1  is  shown  in  Figure  2.  In  this  specific  example  there  are  1 1 
different  candidates  (numbered  from  0  to  10).  The  candidate  selection  has  to  find  the  candidate  that 
best  matches  the  given  message. 
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35 


Degrees  Longitude 


Figure  2.  Eleven  different  candidates  associated  with  the  message  of  Figure  1. 

2 . 2  Probabilistic  Neural  Network  (PNN)  Architecture 

Our  experience  with  the  probabilistic  neural  network  (PNN)  [4,  5,  6,  7]  has  indicated 
certain  advantages  of  this  network  over  other  types  of  artificial  neural  networks.  At  the  lower  level 
of  architectural  design,  PNN  would  be  sufficient  to  act  as  an  independent  building  block. 
However,  for  higher  level  decision  making  and  integration  of  individual  blocks  in  the  overall 
system  design,  more  sophisticated  architecture  and  learning  paradigm  may  be  needed. 

The  PNN  which  has  its  origin  in  Specht's  earlier  work  [8]  has  been  recently  recognized  as 
a  viable  alternative  in  dealing  with  different  classification  problems.  PNN  [9,  10]  is  the 
combination  of  a  kernel  based  estimator  for  estimation  of  probability  densities  and  the  Bayes  rule 
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for  classification  decision.  For  example  in  a  two  class  (A  and  B)  problem  the  probability  densities 
are  found  by: 


(la) 


(lb) 


where  T  is  the  total  number  of  training  points,  (11.11)  is  a  local  kernel  function,  x  is  the  input  to 
the  network,  and  and  are  the  training  data  from  class  A  and  B  respectively.  Equations  (la) 

and  (lb)  measure  the  closeness  (not  necessarily  Euclidean)  of  input  x  to  the  training  data  and  then  a 
simple  comparison  of  the  two  determines  if  x  belongs  to  class  A  or  B.  Similar  strategy  can  be 
adopted  for  more  than  two  classes  of  data. 

The  training  process  in  PNN  is  that  of  finding  a  set  of  appropriate  kernels  <1)^^  (11.11)  and 

there  is  no  weight  adaptation  process.  To  simplify  the  training  process,  Specht  makes  the 
following  three  assumptions: 

i)  All  kernels  are  the  same.  In  other  words, 

<Dn  (llx-cji)  =  <D  (llx-cjl).  for  n=l,  2  . . .  T  (2a) 


ii)  The  kernel  is  a  Gaussian  function  given  by: 

■D  (llx^:  II)  =  (2jc)  '^'"  I  S  exp  ( -  ix  -  c  f  [1]  '  (x  -  c  )  ]  .  (2b) 

^  n  n 


Where  M  is  the  dimension  of  the  input  space  and  Z  is  the  covariance  matrix. 


iii)  And  that  the  covariance  matrix  is  diagonal  and  has  equal  eigenvalues  (Z=cj^I), 


M/2  M  ■  C  II 

O  (llx-c^ll)  =  (27t)  a  exp  [ - ] 


(2c) 


Using  the  above  assumptions  the  problem  of  training  PNN  is  simply  reduced  to  selecting 
parameter  a  that  is  referred  to  as  the  smoothing  parameter.  This  parameter  is  normally  found  by 
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trial  and  error  or  the  nearest  neighbor  criterion.  Musavi  et  al.  [4]  have  recently  presented  a 
technique  for  optimal  selection  of  all  covariance  matrices.  In  our  future  studies,  this  technique  will 
be  utilized  to  enhance  the  accuracy  of  PNN  network. 

The  significant  advantages  of  PNN  classifier  are  its  speed  and  training  process.  First,  the 
training  process  is  one-pass  and  there  is  no  need  for  weight  adaptation,  hence  yielding  great 
processing  speed  as  compared  to  the  back  propagation  (BP),  radial  basis  function  (RBF),  and 
other  types  of  neural  networks.  Second,  the  network  generalizes  to  the  new  incoming  patterns 
without  having  to  repeat  the  training  process.  These  characteristics  are  ideal  for  real  time 
applications  [11,  12,  13].  The  architecture  of  PNN  is  very  simple,  thus,  software/hardware 
implementation  of  the  network  on  conventional  or  parallel  computers  is  easy.  The  processing 
sp)eed  of  PNN  is  achieved  at  the  expense  of  increased  memory  size.  This  is  due  to  the  fact  that  the 
training  data  set  has  to  be  available  for  classification  of  every  incoming  datum.  Specht  in  defense 
of  his  networks  argues  justifiably  that  memory  is  abundant  and  affordable. 

2. 3  PNN  in  Candidate  Selection  Module 

Accurate  selection  of  a  candidate,  among  many  candidates,  to  best  match  a  given  message 
relies  on  many  attributes.  Some  of  these  attributes  are  proximity,  orientation,  shape,  FACS  ID, 
width,  hight  and  etc,  A  robust  system  should  take  advantage  of  all  available  attributes  to  make  the 
best  possible  decision. 

To  apply  artificial  neural  networks  in  the  design  of  candidate  selection,  PNN  will  be  used 
as  an  individual  building  block.  One  PNN  will  be  designed  for  each  attribute.  The  following  two 
sections  will  discuss  structures  of  proximity  PNN  (PPNN)  and  orientation  PNN  (OPNN).  Similar 
structures  can  be  designed  for  shape  PNN  and  other  attributes. 

2.3.1  Proximity  PNN  (PPNN) 

For  the  proximity  attribute,  a  network  like  that  of  Figure  3  is  constructed.  In  this  network, 
the  input  vector  is  two  dimensional  consisting  of  latitude  (LAT)  and  longitude  (LON)  coordinates. 
The  output  is  a  scalar  giving  a  numerical  value  presenting  the  closeness  of  the  input  point  to  the 
message  under  consideration.  The  points  of  the  message  (training  data)  are  set  to  be  the  centers  of 
the  Gaussian  functions.  Therefore,  the  number  of  nodes  in  the  network  equals  the  number  of 
points  in  the  message.  In  case  of  excessive  message  points,  it  is  possible  to  use  some  kind  of 
clustering  algorithm  without  significantly  affecting  the  generalization  ability  of  the  network.  The 
smoothing  parameter,  a,  can  be  either  entered  by  the  operator  using  trial  and  error,  or  found 
automatically  by  the  nearest  neighbor  criteria. 
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The  following  procedure  is  followed  to  find  a  measure  of  confidence  on  the  closeness  of 
each  candidate  to  a  given  message.  A  block  diagram  of  the  procedure  has  also  been  given  in 
Figure  4. 

B:oggdure: 

Step  1.  Set  up  the  PNN  network  for  the  given  message. 

Step  2.  Select  a  candidate  for  the  given  message. 

Step  3.  Input  each  point  of  the  candidate  to  the  PNN  and  calculate  the  output. 

Step  4.  Add  the  output  to  the  previous  output. 

Step  5.  Repeat  Steps  3  and  4  until  all  points  of  the  selected  candidate  are  covered.  The  total 
summation  value  at  Step  4  is  a  measure  of  confidence  on  the  closeness  of  the  selected 
candidate  to  the  given  message. 

Step  6.  Repeat  Steps  2  though  5  until  all  candidates  of  the  selected  message  are  covered. 

Step  7.  Perform  a  normalization  on  the  total  summations  of  Step  4  for  different  candidates. 
Comparison  of  these  normalized  values  give  a  measure  of  confidence  on  each  of  the 
candidates. 

Step  8.  Repeat  Steps  1  through  7  for  a  different  message. 


c 


1’ 


y  =  f(x) 


Figure  3.  The  PNN  architecture  used  in  the  candidate  selection  block. 
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Figure  4.  A  block  diagram  of  the  confidence  generator  for  each  given  candidate. 

2.3.2  Orientation  PNN  (OPNN) 

A  PNN  similar  to  the  one  given  in  Figure  3  can  also  be  used  for  orientation  attribute 
(OPNN).  Note  that  in  this  case  the  input  to  the  network  is  one  dimensional  representing  the 
orientation  of  the  input  candidate  object  The  centers  of  the  Gaussian  nodes  are  the  orientations  of 
different  segments  constructing  the  message  object.  The  output  of  the  OPNN  is  a  measure  of 
similarity  between  the  orientations  of  the  message  and  candidate  objects. 

2.3.3  Candidate  Selection  Architecture 

The  overall  intelligent  system  for  candidate  selection  module  is  built  by  the  integration  of 
PNN  blocks  for  different  attributes.  Figure  5  presents  a  block  diagram  of  the  overall  candidate 
selection  system  design. 
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Figure  5.  The  overall  system  design  for  candidate  selection  module. 
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3. 


TEST  RESULTS 


A  set  of  messages  and  their  associated  candidates  were  obtained  from  Grumman  Data 
System  for  testing  the  methodology  presented  in  Section  2.  The  message  file  consisted  of  19 
different  messages  and  the  candidate  file  consisted  of  76  different  candidates  for  the  19  messages. 
The  19  messages  and  76  candidates  are  shown  in  Figure  6,  the  messages  are  in  solid  and  the 
candidates  are  in  dotted  lines.  Since  all  the  messages  and  candidates  have  been  plotted  on  the  same 
graph  the  beginning  and  end  of  a  message  and  its  corresponding  candidates  are  not  clearly 
identified.  A  message  with  its  corresponding  candidates  are  shown  in  Figures  1  and  2 
respectively.  Another  message  with  its  corresponding  candidates  are  shown  in  Figure  7. 

Table  1  presents  the  results  of  our  experiments  with  the  given  messages  and  candidates 
using  only  proximity  PNN.  The  first  column  gives  the  message  number,  the  second  column  gives 
the  associated  candidates  for  the  message,  and  the  last  column  presents  the  confidence  of  the  PNN 
in  selecting  each  of  the  candidates. 


Figure  6.  Nineteen  messages  and  their  corresponding  candidates. 
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All  programs  for  this  project  were  written  in  C  on  a  SUN  SPARCstation  IPC.  It  should  be 
mentioned  that  for  all  the  19  messages  and  76  candidates  the  program  generated  the  confidence 
measures  of  Table  1  in  under  16  seconds.  The  smoothing  parameter  for  the  PNN  network  was 
selected  to  be  0.002. 


Degrees  Longitude 


Figure  7 .  A  messages  and  its  corresponding  candidates. 
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Message 

1  ■■II  — 

1  Cajididate 

I  Confidence  Level 

2 

46  % 

1 

29  % 

8 

3 

25  % 

0 

0  % 

2 

71  % 

9 

1 

29  % 

0 

0  % 

1 

72  % 

10 

2 

28  % 

0 

0  % 

11 

No  Candidates 

Available 

12 

No  Candidates 

Available 

13 

No  Candidates 

Available 

0 

75  % 

14 

1 

25  % 

2 

76  % 

15 

1 

16  % 

0 

8  % 

1 

86  % 

16 

2 

14% 

0 

0  % 

3 

92  % 

4 

7  % 

1 

0  % 

17 

2 

0  % 

5 

0  % 

6 

0  % 

0 

0  % 

18 

0 

100  % 

0 

80  % 

19 

1 

20  % 

Table  1-b.  The  results  of  the  PPNN  candidate  selection  on  the  test  messages. 


IV.  CONCLUSION  and  FUTURE  WORK 


5.1  Conclusion 

In  this  study,  the  probabilistic  neural  network  (PNN)  architecture  was  successfully  applied 
to  the  candidate  selection  pan  of  the  ASU  program.  At  the  present  time,  the  candidate  selection 
results  are  solely  based  on  the  proximity  criteria.  However,  the  proposed  architecture  should  be 
tested  with  more  test  cases  and  other  attributes  should  be  clued  in  for  system  robustness. 

5.2  Future  Work 

To  enhance  the  candidate  selection  program  further  investigation  needs  to  be  conducted. 
The  following  subject  areas  can  be  considered. 

•  Inclusion  of  Feature  Attributes 

•  Shape  Description 

•  Hardware  Implementation 

5.2.1  Inclusion  of  Feature  Attributes 

An  important  enhancement  of  the  proposed  work  reported  here  is  the  inclusion  of  feature 
attributes.  Additional  artificial  neural  network  (ANN)  modules  can  be  designed  to  handle  these 
attributes.  Each  module  can  be  designed  to  work  internally  as  an  independent  component  and 
externally  as  an  integral  part  of  the  overall  system.  The  ANN  design  depends  of  the  complexity 
and  specifications  of  the  feature.  Radial  basis  function  (RBF)  and  probabilistic  neural  networks 
(PNN)  can  be  investigated  in  this  phase. 

5.2.2  Shape  Description 

Although  inclusion  of  attribute  information  is  important  for  more  accurate  identification  of 
the  candidates,  the  information  might  sometimes  be  unknown,  incomplete,  or  even  wrong  to  be  of 
any  use.  The  overall  system  design  should  therefore  be  able  to  handle  conflicting  scenarios  in  a 
self  organizing  manner.  One  of  the  strong  means  for  achieving  this  objective  is  object 
identification  through  shape  analysis.  Therefore  a  shape  descriptor  algorithm  needs  to  be  devised 
and  implemented.  The  algorithm  has  to  be  size  and  rotation  invariant.  The  design  of  the  shape 
descnptor  requires  a  supervised  neural  network.  However,  its  integration  in  the  overall  candidate 
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selection  design  has  to  be  seif-organized  and  based  on  an  unsupervised  network.  The  radial  basis 
function  (RBF)  and  probabilistic  neural  network  (PNhT)  can  be  utilized  for  this  purpose. 


5.2.3  Hardware  implementation 

For  real-time  application  of  the  candidate  selection  program  a  hardware  implementation  of 
the  proposed  ANN  methodology  is  necessary.  A  commercial  hardware  platform  that  has  recently 
been  designed  for  implementation  of  PNN  and  RBF  networks  can  be  utilized  for  this  investigation. 
The  result  of  our  investigation  may  lead  into  development  of  a  portable  intelligent  hardware  to  be 
used  by  the  Air  Force. 
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APPENDIX 


The  following  is  the  coordinate  information  and  partial  attribute  information  of  a  NUF  message. 

Message  #4 
SCAL  =  50000 
HAC  =  50.000000 
CONF  =  90.000000 
UTC  =  2 
FACS  =  1P030 
TYPE  =  L 

34.9637  ,-80.0499 

34.9638  ,-80.0471 

34.9634  ,-80.045 

34.9633  ,-80.0431 

34.9631  ,-80.0413 

34.963  ,-80.0398 

34.963  ,-80.0384 

34.9631  ,-80.037 

34.9631  ,-80.0356 

34.9632  ,-80.0342 

34.9636  ,-80.0326 

34.9634  ,-80.0314 

34.9634  ,-80.0304 

34.9636  ,-80.0294 

34.9639  ,-80.0282 
i.9638  ,-80.0275 

-4.9634  ,-80.0264 

34.9636  ,-80.0254 

34.9636  ,-80.0237 

34.9636  ,-80.022 

34.9633  ,-80.0202 

34.9632  ,-80.0184 
34.9631  ,-80.0163 

34.9634  ,-80.015 
34.9631  ,-80.0136 

34.963  ,-80.0067 

34.963  ,-79.9996 

34.9626  ,-79.9933 

34.9627  ,-79.9885 

34.9627  ,-79.9861 

34.9622  ,-79.9847 

34.9623  ,-79.9828 
34.9622  ,-79.9803 
34.962  ,-79.9755 

34.9624  ,-79.969 

34.9624  ,-79.9645 

34.9624  ,-79.9609 
34.9629  ,-79.9602 

34.9626  ,-79.9567 

34.9627  ,-79.9548 

34.9625  ,-79.9531 
34.9622  ,-79.9518 
34.9619  ,-79.9506 
34.962  ,-79.9502 
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IPToolkit:  An  Image  ftocessing  Environment 
for  the  X  Window  System 


Roben  R.  Snapp 
Assistant  Professor 
Department  of  Computer  Science  and 
Electrical  Engineering 
University  of  Vermont 

Abstract 

A  prototype  for  an  image  processing  subroutine  library,  tailcaed  to  the  X  Window  System,  is  presented.  This 
library,  called  the  IPToolkit.  is  designed  so  that  a  comprehensive  selection  of  image  processing,  visualization,  and 
analysis  operations  can  be  integrated  into  laiger  software  systems  with  minimal  cost  and  effort.  This  package  cur¬ 
rently  contains  three  modules:  an  input/output  interface  to  grayscale  and  24  bit  color  TIFF  images  (lOManager).  an 
image  processing  module  (IPRoutines).  and  a  visualization  module  that  interfaces  with  the  X  Wmdow  System 
(VPManager). 
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IPToolkit;  An  Image  Processing  Environment 
for  the  X  Window  System 

Robert  R.  Snapp 


Introduction 

The  IPToolkit  is  an  extendable  library  of  computer  subroutines  that  can  be  used  to  add  image  hanHIing  capabili¬ 
ties  to  a  variety  of  software  applications  with  minimal  cost  and  effon.  The  toolkit  consists  of  three  main  modules.  The 
Input-Output  Manager  allows  the  toolkit  to  read  digital  images  from  data  files  stored  in  the  widely  used  TIFF^  fcmiat. 
In  addition,  processed  unages  can  be  stored  in  TIFF  data  files.  The  Viewport  Manager  allows  an  application  to  dis¬ 
play  an  arbitrary  portion  of  a  digital  image  on  a  computer  display  that  is  running  the  X  Window  System  the 
Motif  Wmdow  Manager.  This  module  also  provides  a  number  of  tools  that  allow  the  application  to  change  the  appear¬ 
ance  of  the  image  on  the  display.  Basic  routines  are  provided  for  resize,  pan,  zoom,  and  rotation  operations.  Other 
routines  collect  information  interactively  from  the  user.  For  example,  the  getRectangle  function  allows  the  applica¬ 
tion  to  apply  an  arbitrary  function  on  a  rectangular  portimi  of  the  image  determined  by  the  user  as  the  application  is 
runmng.  In  order  to  minimize  the  effort  involved  in  adding  these  capabilities  to  a  new  or  existing  application  pro¬ 
gram.  most  of  these  operations  are  performed  as  callback  routines  and  event  handlers  that  are  automatically  placed 
under  the  control  of  the  X  Wmdows  event  manager.  Finally,  a  library  of  Image  Processing  Routines  contains  a  variety 
of  useful  operations  to  enhance,  modify,  and  extract  features  from  digital  images.  This  module  is  described  more 
completely  in  the  companion  report  “Image  Processing  Toolkit.”  by  Robert  L.  Stevenson  on  pages  20-1  through  20- 
12  of  this  volume. 

In  the  following  we  give  a  brief  overview  of  the  Input-Output  and  Viewport  Managers,  and  two  Image  Process¬ 
ing  Routines.  Mote  detailed  information  can  be  found  in  the  computer  files  that  comprise  the  toolkit.  Almost  every 
toolkit  subroutine  is  described  in  a  UNIX  manpage.  Other  information  can  be  found  in  the  source  code  itself.  We 

1.  TIFF,  which  stands  for  Tagged  Image  File  Format,  is  a  trademark  of  Aldus  Corporation. 
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would  first  like  to  emphasize,  however,  that  although  this  software  may  functicsi  more  or  less  as  described,  it  is  not 
sufficiently  robust  or  efficient  to  be  released.  The  programs  and  subroutines  developed  therefore  represent  only  a  first 
effort.  Further  efforts  will  be  needed  to  enhance  its  efficiency  and  functionality.  At  the  very  least,  the  IPToolkit.  in  its 
current  state,  demonstrates  the  viability  of  the  concept  of  a  comprehensive  image  processing  environment. 

The  Input-OutPut  Manager 

Two  toolkit  routines  allow  digital  images  to  be  read  from  a  TIFF  file  .^ad  loaded  into  an  IMAGE  data  structure 
(see  page  20-4),  and  the  reciprocal  (^rations.  These  are: 


loadTiffO 

reads  an  image  from  a  TIFF  file 

dumpTiHT) 

writes  an  image  to  a  TIFF  file 

At  present,  only  8  bit  grayscale  and  24  bit,  RGB,  color  images  are  supported.  Both  of  these  routines  call  functions 
from  the  libtiff  subroutine  library.  Libtiff  is  public  domain  software  and  can  be  obtained  by  the  ftp  program  from  the 
internet  host  sgi .  com.  The  source  code  for  these  routines  can  be  found  in  the  subdirectory  lOManager.  Future  tools 
will  include  routines  to  read  and  wnte  FEATURE  data  structures,  graphics  data,  and  training  set  data. 


The  Viewport  Manager 


This  is  the  most  complex  part  of  the  toolkit.  In  addition  to  the  raster  data,  an  application  must  know  what  portion 
of  the  image  should  be  displayed  (i.e.  the  viewport),  how  this  visible  portion  should  be  oriented  and  magnified,  where 
is  should  apptear  on  the  display,  where  the  converted  image  data  is  stared,  etc.  The  viewport  manager  takes  care  of  all 
of  these  details,  and  more.  To  do  so  requires  a  comprehensive  data  structure  which  we  call  an  iPTXPro  ject.  The 
IPTXPro  ject  has  fields  for  the  IMAGE  structure  returned  by  the  loadTiff  subroutine,  a  viewport  structure,  and  X 
image  structure,  a  pixmap  identification  number,  a  list  of  widgets  used  to  display  and  scroll  the  viewport,  and  some 
other  miscellaneous  fields.  The  first  task  then,  will  likely  be  to  convert  an  IMAGE  structure  to  an  IPTXProject. 
Instead,  we  provide  a  routine  that  reads  a  TIFF  file  directly  into  a  Project  structure: 


IPTXloadProjectO 


reads  a  TIFF  image  into  an  IPTXProject  strucmre 


Before  any  imagery  can  be  rendered  on  the  display,  it  is  necessary  to  initialize  the  colormap.  Currently  images 
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can  only  be  rendered  with  a  gray  scale. 


EPTXinitColormapO 

Initializes  the  gray  scale  colormap  and  the  X  Win- 

dow  visual 

The  highest  level  routines  create  composite  Modf  widgets  that  can  be  used  to  display  a  digital  image: 


IPTXcreateSimpleViewingShellO 

creates  a  shell  widget,  a  frame  widget,  a  simple 
menu  bar.  and  a  managed  scroll  area  widget. 

IPTXcreateProjectScrollAreaWidgetO 

creates  an  unmanaged  sctoU  area  widget  for  view¬ 
ing  an  image,  including  horizontal  and  vertical 
scroll  bars,  and  a  drawing  area. 

IPTXcreateProjectDrawingAreaWidge 

0 

creates  a  drawing  area  widget  for  viewing  an 
image. 

Normally,  an  application  needs  to  call  only  one  of  these  for  each  image  area  desired.  For  example,  to  display  the 
image  in  the  TIFF  file  pretty.tif  within  a  simple  viewing  shell,  a  program  might  insert  the  following  segments. 


#includ.e 
finclude 
^include 
# include 


<stdio.h> 

<IPTdatatypG.h> 

<IPTio.h> 

<IPTvp.h> 


Widget  topLevel,  shell; 
iPTXProject  *project; 

IPTXPro jectLink  *link; 

IPTXProjectLink  *pro jectList; 

iPTXListHeader  listHeader  =  IPTX_LlST_HEADER("examples")  ; 

pro jectList  =  IPTXallocPro jectLink ( (IPTXProjectLink  *)  NULL,  SlistHeader, 

(IPTXProjectLink  *)  NULL); 

topLevel  =  XtAppinitialize(&app,  "IPToolkit",  NULL,  0,  sargc,  argv,  NULL,  NULL,  0) • 
IPTXinitColor^lap  (topLevel)  ; 

project  =  IPTXloadProject (topLevel,  TIFF_FILE,  "pretty.tif"); 
link  =  IPTXappendProjectLink  (iPTXgetLastLink  (pro  jectList) ,  project); 
shell  =  IPTXcreateSimpleViewingShell (link) ; 


This  example  is  perhaps  more  elaborate  than  is  necessary:  Here  a  list  of  simple  viewing  shells  is  maintained  so  that 
the  application  can  manage  more  than  one  simple  viewing  shell  simultaneously.  The  routines  that  create  the  scroU 
area  and  drawing  area  widget  allow  an  application  to  invoke  the  visualization  module  at  a  slightly  lower  level.  Using 
IPTXcreateProjectScrollAreaWidgetO  for  example,  an  application  can  insert  the  same  scroll  area  widget  used  by 
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IPTXcreateSimpleViewingShellO  into  its  own  Modf  form,  frame,  or  row -coliunn. widget  Note  that  the  scroll  area 
widget  created  by  IPTXcreateProjectScrollAreaWidgetO  is  umnanaged,  i.e.  the  X  window  manager  has  not  yet 
calculated  where  the  scroll  area  will  appear  on  the  screen.  Consequently,  the  application  must  also  call  IPTXman- 
ageProjectScrollAreaO  before  the  scroll  area  widget  is  realized  and  visible  on  the  screen.  Using  IPTXcre- 
ateProjectDrawingAreaO  similarly  creates  an  image  drawing  area  cmly.  When  these  routines  are  invoked,  the 
widgets  that  they  create  are  assigned  default  callback  functions.  For  the  scroll  bars  used  by  the  scroll  Area  widget, 
these  are  IPTXhorizontalScrollBarCBO  and  IPTXverticalScroUBarCB().  These  two  routines  allow  the  user  to  pan 
the  viewport  across  the  digital  image.  An  auxiliary  routine  IPTXadjustScrollBarsO  is  used  to  modify  the  appearance 
of  the  scroll  bars  after  the  viewport  has  been  changed.  The  amount  of  overlap  (in  pixels)  that  is  maintained  during  a 
scroll  increment  or  decrement  operation  is  maintained  by  the  internal  variable  scrollOverlap.  Its  value  can  be 
changed  using  IPTXsetScrollOverlap(). 

A  number  of  callbacks  have  also  been  developed  for  the  drawing  area  widget.  (This  drawing  area  is  occasionally 
referred  to  as  the  imageArea  in  some  of  the  man  pages,  as  the  digital  image  is  rendered  on  this  widget)  The  default 
callback  function  for  the  drawing  area  created  all  of  the  routines  in  the  previous  table  is  IPTXdrawingAreaCBQ.  It 
manages  motif  expose  (XmCR_EXPOSE)  and  resize  (XmCR_RESIZE)  events.  After  an  expose  event,  the  pixmap 
that  contains  the  visible  portion  of  the  image  is  copied  to  the  display ^  After  a  resize  event,  the  Himpncinnc  of  the 
viewport  are  adjusted  to  agree  with  the  new  dimensions  of  the  drawing  area  widget  Three  special  event  handlers 
have  also  been  developed  to  acquire  geometric  information  from  the  user  interactively.  These  event  handlers  are: 


EPTXelasticRectangeEHQ 

Collects  rectangle  information  m  the  form  “<row> 
<col>  <height>  <width>” 

IPTXelasticSegmentEHO 

Collects  tine  segment  information  in  the  form 
“<rowl>  <coll>  <row2>  <col2>” 

EPTXpixelStatusEHO 

Collects  pixel  information  in  the  form  “<row> 

<col>  <intensity>” 

L  .  .  . 

After  a  ButtonPress,  Mot  ionNot  if  y,  or  ButtonRelease  event  in  the  appropriate  drawing  area,  these  event 
handlers  will,  if  appropriate,  update  a  graphical  overlay  on  the  display,  save  the  geometric  information  about  the 


1 .  Efficiency  can  be  gained  in  future  edidons  of  the  toolkit  by  setting  the  background  pixmap  of  the  drawing 
area  to  the  pixmap  of  the  project,  or  by  writing  the  image  direcdy  to  the  display. 
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overlay  in  the  current  IPTXProject  structure,  and  call  the  appropriate  auxiliary  event  handler  assigned  to  the  IFTX- 
Project.  The  routine  IPTXelastkRectangleEHO.  for  example,  writes  a  colored  rectangular  boundary  over  the  dis¬ 
play.  One  comer  of  this  rectangle  is  fixed  on  the  window  where  the  pointer  was  when  the  Butt  onPr  ess  event  was 
registered,  the  other  (q)posite)  cmier  follows  the  pointer  as  it  moves  across  the  screen  with  the  button  pressed 
(MotionNotify).  After  releasing  the  button  (ButtonRelease).  the  rectangle  is  erased  and  the  event  hanHW 
removes  itself  from  the  event  handler  list.  After  each  of  these  events  a  character  suing  is  reallocated  that  can  be  refer¬ 
enced  from  the  data  field  of  the  IPTXProject  suucture,  or  by  calling  the  routine  IPTXgetProjectData().  This  string 
contains  the  four  fields  indicated  in  the  preceding  table  that  represent  the  window  coordinate  of  the  upper  left  comer 
of  the  rectangle  and  its  height  and  width  (in  window  coordinates).  IPTXelasticSegment()  and  IPTXpixelSta- 
tusEHO  function  sirmlarly,  and  are  described  in  the  toolkit’s  man  pages. 

Normally,  the  application  will  not  call  these  event  handlers  directly.  Instead,  each  has  a  corresponding  routine 
that  adds  it  to  the  event  list,  along  with  an  application  specific  routine  that  should  parse  the  data  field  of  the  project 
and  take  appropriate  actiom 


IPTXgetRectangleQ 

Construct  and  process  a  rectangular  portion  of  the 
image  defined  by  the  user. 

IPTXgetSegmentQ 

Construct  and  process  a  line  segment  defined  by 
the  user 

IPTXgetPixelQ 

Select  and  process  a  pixel  defined  by  the  user. 

Thus,  for  example.  IPTXgetRectangleQ  can  be  used  to  allow  an  application  to  process  an  arbitrary  rectangular 
portion  of  the  image  mteractively.  This  function  is  especially  useful  for  clipping  operaticms  and  for  constructing  a 

rectangle  in  a  gr^hical  overlay.  IPTXgetSegmentQ  can  be  used  to  interactively  extract  distance  infamation  from  an 
image. 


Viewport  Geometry 

Before  descnbmg  the  viewport  manipulation  routines,  it  is  helpful  to  understand  how  the  geometry  of  the  viewport  is 
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maintained.  The  following  table  illustrates  the  three  cowdinate  systems  that  are  used  to  render  images  on  the  display. 

O  o 


h 


H 

In  the  above,  rectangle  howt  represents  the  entire  image  raster,  which  is  assumed  to  have  h  pixel  rows  and  w  pixel  col¬ 
umns.  Rectangle  abdc  represents  the  visible  portion  of  the  image,  viz.  only  the  pixels  within  this  smaller  area  will  be 
rendered  in  the  project’s  drawing  area  widget  We  usually  call  this  rectangle  the  project’s  viewport.  When  we  pan  the 
viewport,  e.g.,  by  incrementing  the  scroll  bar  widgets,  we  are  in  effect  moving  viewport  abdc,  while  preserving  its 
rectilinear  orientation.  In  addition,  the  position  of  the  viewport  is  constrained  so  that  abdc  always  lies  entirely  within 
the  larger  rectangle  HOWT.  We  call  the  latter,  the  global  viewport  In  this  representation,  we  regard  the  orientations  of 
abdc  and  HOWT  as  fixed;  ray  OH  is  always  oriented  vertically  and  ray  OW  always  horizontally.  However,  we  will 
allow  the  orientation  of  the  image  to  vary.  In  the  above  illustration,  for  example,  the  image  {abdc)  is  inrIinpH  by  an 
angle  0  with  respect  to  HOWT.  (Here  9  <  0,  as  it  falls  in  quadrant  IV).  We  will  call  0,  the  viewport  angle.  The  global 
viewport,  then  is  defined  to  be  the  smallest  rectangle  in  the  given  orientation  that  complete  y  encloses  the  project’s 
image  for  the  current  viewport  angle.  As  0  varies,  howt.  will  maintain  is  rectilinear  orientation,  but  will  shrink  and 
stretch  so  that  it  exactly  encloses  the  full  extent  of  the  image.  After  the  viewport  is  modified,  the  image  pixels  that  it 
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contains  are  mapped  into  an  Xlmage  structure  that  conforms  to  the  dimensions  of  the  current  drawing  area.  We  refer 
to  point  0  in  the  above  figure  as  the  origin  of  the  Global  Viewport  Coordinate  System,  and  point  o.  as  the  origin  of  the 
Image  Coordinate  System.  We  call  point  a  the  Viewpoa  Origin. 

Four  basic  operations  are  used  to  manipulate  the  viewpon  and  consequently  modify  the  appearance  of  the  dis¬ 
played  image.  The  translation  operation  changes  the  locadon  of  the  viewport  in  the  mannpr  described  above.  The 
rotation  operation  changes  the  viewport  angle  in  such  a  way  so  that  image  appears  to  rotate  about  the  point  at  the  cen¬ 
ter  of  the  viewpat.  The  magnification,  or  zoom  operation,  makes  changes  the  rate  at  which  the  pixels  are  sampled 
when  the  viewport  is  mapped  to  the  Xlmage  structure.  An  increase  in  the  magnification  factor,  without  a  commensu¬ 
rate  increase  in  the  dimensions  of  the  drawing  area,  will  impUcidy  reduce  the  size  of  the  viewport  Finally,  the  win¬ 
dow  resize  operation  changes  the  size  of  the  project’s  drawing  area.  A  window  resize  operation  will  implicidy  modify 
the  dimensions  of  the  viewport.  To  avoid  unwanted  interference  between  these  four  operations,  the  viewport  structure 
maintains  the  image  coordinates  of  its  center,  the  viewport  angle,  the  magnification  factor,  and  the  Himpncinnc  of  the 
project  drawmg  area.  In  addidon  to  the  previously  described  scroll  bar  callback  functions,  the  following  operations 
can  be  used  to  manipulate  the  position  of  the  viewport  without  changing  the  other  viewport  parameters:: 


IPTXmoveViewportAbsO 

Translates  the  viewport  origin  so  that  it  coincides 
with  a  specified  global  viewport  coordinate. 

IPTXmoveViewportRelO 

Moves  the  viewport  origin  relative  to  its  current 
location  in  the  global  viewport  coordinate  system 

IPTXgetViewportOriginGVCO 

Returns  the  location  of  the  viewport  origin  within 
the  global  viewport  coordinate  system. 

IPTXgetViewportOriginICO 

Returns  the  location  of  the  viewport  origin  within 
the  image  coordinate  system. 

The  routine  IPTXadjustOffsetO  will  ensure  that  the  viewport  lies  entirely  within  the  Global  Viewport  Frame: 
rectangle  HOWY  above.  Routine  IPTXgetGlobalViewportRangeQ  returns  the  dimensions  of  the  Global  Viewport 
Frame,  and  IPTXfromGVCtoICO  and  IPTXfromICtoGVCO  convert  pixel  locations  between  the  Global  Viewport 
and  Image  Coordinate  Systems. 
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The  following  routines  can  be  used  to  change  the  orientaticm  of  the  image  with  respect  to  the  viewp)ort: 


IPTXgetViewportAngleO 

Returns  the  current  viewport  angle 

IPTXrotateViewportAbsO 

Sets  the  viewport  angle 

EPTX  rotate  ViewportRelO 

Adds  or  subtracts  to  the  viewport  angle 

IPTX  rotate  ViewportForwardCBO 

Rotates  the  image  by  the  rotationincrement 
in  the  clockwise  sense 

IPTXrotateViewportBackwardCBO 

Rotates  the  image  by  the  rotationincrement 

in  the  counter-clockwise  sense 

DPTXgetRotationlncrementO 

Returns  the  rotationincrement 

IPTXsetRotationIncrementO 

Sets  the  rotationincrement 

IPTXnuUViewportAngleCBO 

Sets  the  viewport  angle  to  0 

following  routines  modify  the  magnification  factor: 

IPTXsetViewportMagnificationO 

Sets  the  viewport  magnification 

BPTXgetViewportMagnificationO 

Returns  the  viewport  magnification 

EPTXzoomlnCB  () 

Multiplies  the  magnification  by  zoomFactor 

IPTXzoomOutCBO 

Divides  the  magnification  by  zoomFactor 

IPTXnormalViewportMagnificationCBO 

Sets  the  magnification  to  unity 

IPTXsetZoomFactorO 

Sets  the  zoomFactor 

IPTXgetZoomFactorO 

Returns  the  zoomFactor 

Image  Processing  Module 

The  image  processing  module  is  described  in  detail  in  the  following  report  “Image  Processing  Toolkit”.  Here, 
however,  we  describe  two  new  image  processing  routines. 


IPCannyEdge() 

Detects  edges  ir  n  image  according  to  Canny’s 
algorithm 

IPMarrHildrethEdgeO 

Detects  edges  in  an  image  according  to  the  algo¬ 
rithm  proposed  by  Mart  and  Hildreth. 

Applications 

Three  preliminary  appUcations  were  developed  to  help  debug  portions  of  the  IPToolkit  and  also  demonstrate 
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some  of  Its  features.  The  program  ip  appUes  a  single  image  processing  operatim  to  the  image  COTtained  in  a  TIFF 
file,  and  stores  the  processed  image  in  TIFF  format  in  a  new  file.  As  this  is  batch  program,  only  the  lOManager  and  IP 
Library  are  used. 

A  second  application,  called  IPView  uses  the  viewport  manager  to  display  an  arbitrary  number  of  TIFF  images 
simultaneously.  Here,  extensive  use  is  made  of  the  routine  IPTXcreateSimpleViewingShellO. 

A  third  application,  called  iptjest  integrates  many  of  tlte  functions  of  the  IPToolkit  with  the  IE2000  imagery 
server  developed  at  Rome  Laboratory. 

Conclusion 

Although  this  version  of  the  IPToolkit.  is  preliminary,  it  demonstrates  the  feasibility  of  a  comprehensive  subrou¬ 
tine  library  for  mampulating  and  displaying  digital  images  in  the  X  windows  environment.  In  its  current  state,  the 
IPToolkit  should  provide  a  useful  starting  point  for  future  efforts. 
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IMAGE  PROCESSING  TOOLKTr 


Robert  1  Stevenson 
Assistant  Professor 
Department  of  Electrical  Engineering 
University  of  Notre  Dame 


Abstract 

For  effective  exploitation  of  digital  imagery  data  it  is  essential  to  process  the  data  using  a  variety  of  techniques. 
The  algorithms  through  which  the  data  needs  to  be  run  depends  on  the  source  of  the  data,  the  environment  in  which  it 
was  collected  and  the  goals  of  the  exploitation  task.  If  the  data  is  noisy,  hltering  is  needed.  For  better  visualization  of 
the  imagery,  enhancement  is  needed.  For  characterization  of  data,  features  needs  to  extracted.  In  order  to  segment  the 
image  into  regions  of  interest,  classifying  routines  are  needed.  Thus  there  is  great  need  for  a  collection  of  image  pro¬ 
cessing  routines  which  can  easily  and  effectively  used  on  a  variety  of  data.  This  project  entailed  the  design,  creadon 
and  documentation  of  an  image  processing  toolkit  The  basic  fram  of  the  toolkit  was  established,  basic  image  pro¬ 
cessing  routines  were  implemented  and  flexibility  was  added  so  that  additional  routines  can  be  added  as  they  are 
developed. 
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IMAGE  PROCESSING  TOOLKIT 


Robert  L.  Stevenson 

latroductim 

For  effective  exploitation  of  digital  imagery  data  it  is  essential  to  process  and  to  visualize  the  data  using  a  variety 
of  techniques.  The  algorithms  through  which  the  data  needs  to  be  run  depends  on  the  source  of  the  data,  the  envircn- 
ment  in  which  it  was  collected  and  the  goals  of  the  exploitation  task.  If  the  data  is  noisy,  filtering  is  needed.  For  better 
visualization  of  the  imagery,  enhancement  is  needed.  For  characterization  of  data,  features  needs  to  extracted.  In 
order  to  segment  the  image  into  regions  of  interest,  classifying  routines  are  needed.  For  visualization,  routines  are 
needed  to  display,  pan,  zoom,  in  addition  to  other  operations.  Thus  there  is  great  need  for  a  collection  of  imagR  pro¬ 
cessing  and  visualization  routines  which  can  easily  and  efectively  be  used  on  a  variety  of  data.  The  overall  project 
entailed  the  design,  creation  and  documentation  of  an  image  processing  toolkit.  The  toolkit  consists  of  three  essential 
components, 

1.  Image  Processing  Routines. 

2.  \^sualization  Routines,  and 

3.  Input/Output  Routines. 

The  input/output  routines  provide  the  c^ability  to  read  and  write  image  data.  They  currently  can  read  and  write 
image  data  in  the  Hff  6.0  format.  The  visualizatitm  routines  create,  maintiiin  and  process  a  set  of  X  motif  based  wid¬ 
gets  for  the  visualization  of  image  data.  Current  capabilities  include  the  display,  magnification,  rotation  and  scrolling 
of  imagery.  The  image  processing  routines  provide  basic  processing  of  the  image  data. 

This  part  of  die  project  centered  on  the  design  and  implementation  of  the  image  processing  routines.  The  basic 
form  of  the  toolkit  was  established,  basic  image  processing  routines  were  implemented  and  flexibility  was  added  so 
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that  additional  routines  can  be  added  as  they  are  developed. 


Implementation 

In  image  processing  there  are  three  basic  types  of  routines. 

1 .  routines  which  process  the  image  and  output  a  modified  image  (e.g.  filtering  routines), 

2.  routines  with  extract  some  metric  from  the  image  (e.g.  find  the  maximum  of  the  image),  and 

3.  routines  which  transform  the  image  data  in  order  to  extract  informadoQ  (e.g.  Fourier  transform  or 
an  edge  detector). 

The  input  to  each  of  these  type  of  routines  is  an  image  data  structure.  The  output  of  the  first  type  of  processing  is  also 
an  image  data  structure.  The  output  of  the  second  type  of  processing  is  a  number  (or  array  of  numbers).  The  output  of 
the  third  type  of  processing  has  the  same  dimensions  of  an  image  but  quit  often  requires  a  higher  intensity  resolution 
then  that  needed  to  represent  the  image  data.  In  order  to  support  the  different  basic  resolution  needs  of  the  data  types, 
the  following  two  structures  were  defined. 


typedef  struct  { 
int  fmt ; 

u_long  h;  /*  height  */ 

u_long  w;  /*  width  */ 

u_long  p;  /*  tplanes,  e.g.  3  for  RGB,  1  for  Grayscale  */ 

PIXEL  ***data;  /*  raster  */ 

/*  data[0] [0]  points  to  the  beginning  of  a  */ 
/*  contiguous  raster  of  h*w*p  pixels;  */ 

/*  data[k] [0]  points  to  the  beginning  of  */ 
/*  the  k-th  plane  (or  color);  */ 

/*  data[k][i)  points  to  the  first  pixel  in  */ 
/*  the  i-th  row  of  the  k-th  plane.  */ 

PALETTE  *palette;  /*  Only  used  for  palettized  color  images  */ 

}  IMAGE; 

typedef  struct  { 

u_long  h;  /*  height  */ 

u_long  w;  /*  width  */ 

u_long  dim;  /*  number  of  planes  */ 

float  ***data;  /*  data,  orgainized  as  in  IMAGE  */ 

}  FEATURE; 


The  important  differerxx  between  the  two  data  structures  is  in  the  resolution  of  the  data  variable.  In  the  FEA¬ 
TURE  structure,  data  is  a  float,  where  as  in  the  IMAGE  structure,  data  is  a  PIXEL.  PIXEL  is  currently  defined  to  be 
an  unsigned  char  (8  bits),  but  can  be  easily  changed  to  suppnt  higher  resohiticm  imagery. 
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Each  roudne  in  the  image  processing  toolkit  is  passed  an  input  IMAGE  and  any  other  necessary  parameters.  The 
image  processing  routine  allocates  the  necessary  space  for  the  output  data  structure,  processes  the  input  imagp.  and 
returns  a  pointer  to  the  output  If  the  routine  fails  (due  to  ruiming  out  of  memory  or  incorrect  parameters)  an  error 
message  is  processed  by  Error()  and  the  routine  returns  a  NULL  pointer.  For  example,  to  perform  a  5  by  5  square 
median  filtering  operation  on  the  image  Inlmage.  the  followiag  call  is  made 

IPSquareMediandnImage,  5,5); 

If  the  routine  is  successful  a  pointer  to  the  processed  image  data  is  returned,  if  unsuccessful  NULL  is  returned. 
Routines  hnolemented 

Hus  section  contains  a  list  of  the  implemented  routines,  grouped  by  the  type  of  processing  they  perfonn.  The 
routines  can  also  be  grouped  by  the  type  of  data  diat  they  return.  Most  routines  either  return  a  pointer  to  a  FEATURE 
or  IMAGE  structure.  Space  for  processed  data  is  allocated  within  the  routine.  All  of  the  Tmag«»  Handling  image 
Enhancement.  Image  Hltering,  and  Image  S^mentation  routines  allocate  and  return  a  pointer  to  an  IMAGE  struc¬ 
ture.  In  addition  the  routines  IPAddBitErrorO.  IPAddGaussianO.  IPAddlmpulsiveO.  IPAddUniforaO,  IPImag- 
eToFeatureO  and  aUocImageO  return  IMAGE  pointers.  All  of  the  Feature  Handling  (except  IPImageToFeatureO) 
and  Feature  Extraction  routines  (except  IPCoOccumenceO)  allocates  and  returns  pointers  to  FEATURE  structures. 
In  addition  allocFeaturelnitO  returns  a  FEATURE  pointer.  The  remaining  routines.  IPImageHistogramO.  IPIm- 
ageMaxO.  BPImageMinQ.  IPImageMomaitsO.  IPImag^tdOevO  and  IPCoOccurrenceO,  all  return  infonnatimi 
about  the  image  generally  in  some  multidimensicxial  floating  point  array.  See  their  respective  manual  pages  for  more 
details. 

Image  Handling  Routines; 

IP3GrayToCok)rO 
IPAddConstantO 
IPAddlmageO 
IPCopylmageO 
IPCropImageO 
IPImageFUpHorizontalO 


creates  a  color  image  by  copying  planes  from  three  grayscale  imagM 
add  a  constant  value  to  an  image 
add  two  images  together 
creates  a  copy  of  an  image 

create  smaller  image  by  cropping  a  piece  from  a  larger  image 
create  die  minxr  image 
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IPImageFIipVerticalO 

IPIinageRotate900 

IPImageScaleO 

IPImageToColoK) 

EPImageToGrayO 

DPImageToImageO 

IPMultConstantO 

IPMultlmageO 

IPSublmageO 

Image  Enhancement  Routines: 

IPCenterMeanO 

IPConstrastAdjustO 

EPGammaAdjustO 

IPHistogramEqualO 

IPIntensityMapO 

IPInvertO 

IPStretchRangeO 

IPUnSharpMaskO 

Image  Filtering  Routines: 

EPATrimmedCrMeanO 

IPATrimmedSqMeanO 

IPAverageO 

EPConvolutionO 

IPCrossMedianO 

IPGanssianSmoothO 

IPSqnareMedianO 


flip  image  around  horizontal  axis 
rotate  image  by  increments  of  90  degrees 
resizes  an  image  by  a  scale  factor 

creates  a  color  image  by  copying  three  image  planes  from  the  origi¬ 
nal 

creates  a  grayscale  image  by  selecting  one  of  the  image  planes 

maps  one  image  structure  to  another  with  a  possible  reordering  of 
planes 

multiplies  an  image  by  a  constant  value 
multiply  two  images  together 
subtraa  one  image  from  another 


adjusts  the  mean  of  the  image 
stretch  or  ccanpress  the  image  contrast 
adjust  the  image  gamma 

do  contrast  enhancement  by  histogram  equalization 

perform  a  rowing  of  the  intensity  values  in  the  image 

invert  the  intensity  values  of  an  image 

stretch  the  range  of  the  image  values 

enhance  an  image  using  an  unsharp  mask  operator 


filter  an  image  witii  a  ali^-trimmed  cross  mean 

filter  an  image  with  a  alpha-trimmed  square  mean 

filter  an  image  with  an  averager 

filter  an  image  by  convolving  image  with  kernel 

filter  an  image  with  a  cross  median 

filter  an  image  by  ctmvolving  image  with  a  Gaussian  kernel 

filter  an  image  with  a  square  median 
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Image  Statistical  Routines: 


IPAddBitError() 

add  an  impulsive  noise  signal  to  an  image 

IPAddGaussianO 

add  an  Gaussian  noise  signal  to  an  image 

IPAddlmpolaveO 

add  an  impulsive  noise  signal  to  an  innigp. 

EPAddUniformO 

add  an  uniform  noise  signal  to  an  image 

IPImageHistogramO 

return  the  histogram  of  an  image 

IPImageMaxO 

return  the  maximum  of  an  image 

IPImageMeanQ 

return  the  mean  of  an  image 

IPImageMinO 

return  the  miniTwim  of  an  image 

IPImageMomentsO 

return  the  moments  of  an  image 

IPImageStdDevO 

return  the  standard  deviatum  of  an  image 

Image  Segmentation  Routines: 

IPThresholdO 

threshold  an  image  to  create  a  bi-level  image 

Feature  Handling  Routines: 

IPCombineFeaturesO 

combines  two  feature  lists,  deleting  migtaals 

IPDeleteFeaturePlaneO 

removes  a  feature  plane  from  a  feature  structure 

IPFeaturetoImageO 

convert  an  feature  image  to  a  intensity  image 

IPImageToFeatureO 

convert  an  intensity  image  to  a  feature  image 

IPMergeFeaturesO 

combines  two  feature  lists 

Feature  Extraction  Routines: 

IPAboveThresholdO 

counts  the  number  of  pixels  above  a  threshold  value 

IPBelowThreshoklO 

counts  the  number  of  pixels  below  a  threshold  value 

IPCoOccurCIusterQ 

computes  the  cluster  tendency  of  the  co-occurrence  probability 
matrix 

IPCoOccurContrastO 

computes  the  ctmtrast  of  the  co-occurrence  probability  matrix 

IPCoOccurCorrelationO 

computes  the  correlatioo  of  the  co-occunenoe  probability  matrix 

IPCoOccurEnergyO 

computes  the  unifonnity  of  the  energy  of  the  co-occurrence  probabil¬ 
ity  matrix 
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IPCoCyccurEntropyO 

IPCoOccurHomogeneityO 

IPCoOccurlnverseMomentO 

IPCoOccurMaxO 

IPCoOccurrenceO 

DPFeatureConvolutionO 

DPFreiEdgeO 

IPKirschEdgeO 

IPPrewittEdgeO 

IPRobertsEdgeO 

IPRobinsonEdgeO 

IPSobelEdgeO 

IPWindowMeanO 

IPWindowMomentsO 

IPWindowStdDevO 

Memory  Handling  Routines: 


conputes  the  entropy  of  the  cx)-occurrence  probability  matrix 

cmnputes  the  homogeneity  of  the  co-occurrence  probability  matrix 

ccanputes  the  inverse  difference  moment  of  the  co-occurrence  proba¬ 
bility  matrix 

asnputes  the  maximum  of  the  co-occurrcnce  probability  matrix 

compute  the  co-occurrence  probability  matrix 

create  feature  by  convolving  image  with  kernel 

performs  edge  deuxdon  ncing  the  Frei  and  Chen  operator 

perfonn  a  Kirsch  edge  detecdon 

perform  a  Prewitt  edge  detection 

perform  a  Roberts  X  edge  detection 

perform  a  Robinson  edge  detection 

perfonn  a  Sobel  edge  detection 

creates  feature  image  based  oi  mean  of  a  window 

creates  feature  images  based  cm  moments  of  an  image 

creates  feature  image  based  cm  standard  deviation  of  a  window 


allocFeaturelnitO 

addFeaturelmagesO 

freeFeatureO 

deleteFeatui^lmageO 

allocImageO 

freelmageO 

lireelmageDataO 


initializes  the  image  feature  structure 

to  add  feature  data  to  existing  feature  structure 

free  feature  structure 

delete  feature  images 

initialize  the  image  structure 

frees  the  image  structure 

delete  the  image  data 


Adding  NftwRmirines 

One  of  the  consideraticms  in  the  design  of  the  toolkit  was  to  allow  new  routines  to  be  easily  added.  This  flexibil¬ 
ity  was  added  by  providing  an  cxganized  and  easily  understood  flle  structure,  providing  Makefiles  which  simplify  the 
compiling  and  the  inclusion  of  new  routines  into  the  binary  and  finally  by  providing  useful  internal  subroutines  to 
simplify  the  implementatioo  of  common  image  processing  techniques.  Adding  a  new  routines  consists  of  the  fr '  ow- 
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ing  basic  steps: 


1.  Decide  on  the  apprqjriate  subdirectory  under  EPRoutines  for  the  new  routine. 

2.  Create  the  new  source  code,  IPNewRoutine.c. 

3.  Create  a  man  page  for  the  routine.  IPNewRoutine.3IP. 

4.  Edit  Makefile.  Add  IPNewRoutine.o  to  the  definition  of  OBJECT  at  the  top  of  the  Makefile. 

5.  Edit  the  header  file  in  the  subdirectory  to  include  the  new  routine’s  function  prototype. 

6.  Type  make  to  compile. 

7.  In  the  top  level  directory  type,  “make  clean;  make”  to  add  the  new  routine  to  the  library. 

To  help  with  steps  2  and  3  templates  and  several  subroutines  have  been  written. 

Many  image  processing  routines  are  based  on  the  concept  of  either  a  point  operation  or  a  window  operation.  For 
a  point  (qteratiai  die  ouQ)ut  at  a  pixel  is  based  only  on  the  input  pixel  value.  For  a  window  operation  the  output  is 
based  cn  a  small  neighborhood  of  values  centered  at  the  pixel  location.  For  both  of  these  types  of  operatiois  the  out¬ 
put  is  computed  by  scanning  through  the  image.  This  scanning  process  has  been  antomauvi  through  the  use  of  several 
subroutines.  These  routines  all  begin  with  the  word  Process,  and  mainly  differ  in  the  type  of  data  inpittivl  or  ouQmt- 
ted.  Their  common  t^proach  is  to  be  used  after  the  called  subroutine  has  initialized  and  setup  any  necessary  data 
structures.  They  are  called  with  parameters  for  the  input  image  (or  feature)  and  a  function  pointer  which  points  to  a 
function  to  do  the  processing  at  each  node.  The  easiest  way  to  explain  the  use  of  these  subroutines  is  by  examining  a 
sample  routine.  The  toolkit  routinelPUnSharpMaskO  is  a  window  operator  which  sharpens  imagt*.  edges  by  aHHing  a 
weighted  highpass  version  of  the  original.  The  code  using  the  Process  WindowQ  routine  is  as  follows. 

#include  <IPTdatatype . h> 

#include  <IPInternal .h> 

static  long  IWindowSize; 
static  float  fWeightParam; 
static  long  IHeightOrder; 
static  long  IWidthOrder; 

IMAGE  *IPUnSharpMask  (IMAGE  *piinageln,  u_long  ulHeight, 

u_long  ulWidth,  float  fWeight) 

1 

IMAGE  *pimageOut; 

static  PIXEL  ComputePoint ( ) ; 
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IWindowSize  =  ulHeight*uiWidth; 
fWeightParam  =  fWeight; 

IHeightOrder  =  ulHeight/2; 

IWidthOrder  =  ulWidth/2; 

pimageOut  =  ProcessWindow (pimageln,  ulHeight,  ulWidth, ComputePoint) ; 
return  (pimageOut) ; 

} 

static  PIXEL  ComputePoint (IMAGE  *pimageWindow) 

{ 

PIXEL  *ppixelData; 

PIXEL  ulAverage; 
long  lEnhanceValue; 

ppixelData  =  pimageWindow->data (0]  [0] 
ulAverage  =  pixelAvg (ppixelData) ; 
lEnhanceValue  =  (1 . 0+fWeightParam)  * 

pimageWindow->data(0] [IHeightOrder] [IWidthOrder] 
-  fWeightPareim’ulAverage; 
return (CLIP (lEnhanceValue) ) ; 

] 


The  routiiie  ProcessWindowO  scans  through  the  image  pimageln.  For  each  pixel  value  a  subimage  of  height 
ulHeight  and  width  ulWidth  is  extracted,  put  into  the  image  structure  pimageWindow,  and  passed  to  the  function 
ComputePointO  for  processing.  Ail  the  called  routine  does  is  set  up  some  global  variables,  and  the  caU  ProcessWin* 
dow()  to  handle  scanning  through  the  image.  The  function  ComputePointO  uses  only  the  window  of  passed  data  to 
compute  a  single  pixel  output  value.  This  value  is  returned  by  ComputePointO.  Thus  writing  a  new  routine  only 
requires  the  implementation  of  two  simple  subrouuoes.  The  toolkit  level  routine  is  only  used  to  initialize  global  vari¬ 
ables  and  to  call  the  appropriate  “Process”  routine.  The  processing  routine  only  needs  to  process  output  value  for  the 
passed  window. 

Internal  Routines:  The  routines  listed  below  are  provided  for  internal  use  by  todldt  routines,  hi  addition  to  the 
“Process”  routines  described  above,  several  common  and  useful  processing  tasks  have  been  included.  To  use  in  new 
routines,  include  the  header  file  IPIntemal.h  in  your  source  code.  Details  of  their  use  can  be  found  by  looking  at 
their  source  code. 

CoOccnrrenceO  fill  the  co-occurrence  matrix 
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EiTor() 

ProcessFeaturePointO 


print  cnat  error  message 


ProcessFeaturePointO  compute  feature  from  image  point 

ProcessFeatureWindowO  compute  feature  from  image  window 

ProcessFeatureWindowFeature  compute  feature  from  feature  window 
ProcessMultFeatureWindowO  compute  multiple  features  from  image  window 


ProcessPointO 

Process  WindowO 

floatAvgO 

floatMaxO 

floatMinO 

floatSortO 

noiseO 

pixelAvgO 

pixelMaxQ 

pixelMinO 

pixelSortO 


compute  image  from  image  pdnt 
compute  image  from  image  window 
average  vector  of  floats 
find  maximum  of  a  float  vector 
find  minimum  of  a  float  vector 
sort  a  vector  of  floats 
generate  a  rantkon  number 
average  vector  of  pixels 
find  maximum  of  a  pixel  vector 
find  minimum  of  a  pixel  vector 
sort  a  vector  of  pixels 


The  Image  Processing  Toolkit  was  conceived  and  implemented  to  ease  the  implonentation  of  high-level  user 
^plications.  An  integral  part  of  this  toolkit  is  the  image  processing  routines  which  provide  basic  image  processing  to 
the  high-level  ^plication  programmer.  The  objective  of  this  initial  work  was  to  provide  for  three  essential  canqx>- 
nents  for  image  processing,  they  are: 

1.  Easy  to  understand  and  use  data  structures  and  file  structures  so  that  the  toolbox  routines  can  be 
used  by  an  t^plicatico  programmer. 

2.  Implementation  of  a  basic  set  of  image  processing  routines.  This  provides  an  example  set  of  rou¬ 
tines  on  which  future  wok  can  build. 

3.  Implementation  of  a  mechanism  so  that  the  typical  imagp.  processing  routine  can  be  quickly  imple¬ 


mented. 
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This  initial  work  has  reached  these  goals.  Simple  but  flexible  image  and  feature  structures  have  been  defined,  over  70 
basic  image  processing  routines  have  been  implemented,  tested,  and  documented,  and  finally  through  the  use  of  sub¬ 
routine  calls  using  function  pointers,  a  mfr-hanism  for  the  simple  implementatiop  of  pointwise  and  window  (^)erators 
has  been  included. 
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TERRAIN-SCATTERED  RADAR  CLUTTER  SUPPRESSION  USING  SIGNAL  RESTORATION 


John  F.  Doherty 
Assistant  Professor 

Department  of  Electrical  Engineering  and  Computer  Engineering 


Abstract 

This  report  presents  research  results  on  a  new  method  for  the  suppression  of  terrain-scattered 
clutter  due  to  intentional  jamming  of  the  radar  footprint.  The  new  method  applies  signal 
restoration  processing  on  the  conventional  beamforming  output.  The  temporal  processing  of  the 
received  radar  signal  attempts  to  restore  the  known  properties  of  the  transmitted  radar  waveform. 
This  approach  relies  on  the  transmitted  radar  signal  having  coded  phases,  such  as  a  discretized 
chirp  or  polyphase  waveform.  The  properties  of  the  code  are  used  to  distinguish  the  desired  radar 
return  from  the  jamming  and  clutter  interferences.  Because  of  the  assumed  prior  knowledge  of  the 
transmitted  waveform,  it  is  not  necessary  to  have  an  auxiliary  record  of  the  jamming  signal.  The 
signal  processing  is  also  iterative,  requiring  no  sensor  correlation  matrix  formation  or  inversion. 
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TERRAIN-SCATTERED  RADAR  CLUTTER  SUPPRESSION  USING  SIGNAL  RESTORATION 


John  F.  Doherty 

1  INTRODUCTION 

The  radar  signal  processing  problem  considered  in  this  report  is  the  suppression  of  terrain- 
scattered  jamming  (clutter)  in  the  main  beam  of  the  radar  array  response.  The  clutter  is  usually 
distributed  such  that  spatial  nulling  in  the  main  beam  results  in  a  significant  loss  in  target 
detection  [1],  The  alternative  approach  is  to  temporally  process  the  sensor  outputs  by  estimating 
the  clutter  delay  profile  and  subsequently  subtracting  the  clutter  estimate  [1,2].  These  approaches 
are  limited  by  the  requirement  of  a  clutter-free  record  of  the  jamming  signal.  This  usually 
assumes  an  auxiliary  antenna  directed  at  the  jammer.  In  addition,  there  is  either  an  explicit  or 
implicit  calculation  of  the  sensor  matrix  and  its  inverse.  This  may  limit  the  applicability  of  such 
approaches  to  problems  of  small  dimension,  for  example,  arrays  with  a  small  number  of  sensors. 
The  performance  gains  of  such  space-time  adaptive  approaches  to  experimental  airborne  pulsed 
radar  data  was  recently  demonstrated  for  systems  with  relatively  low  processing  complexity  [3]. 
The  new  approach  differs  from  these  in  three  ways:  it  does  not  rely  on  a  clean  look  at  the  jamming 
signal,  it  does  not  require  correlation  matrix  calculation  or  inversion,  and  it  operates  on  the 
uncompressed  pulse. 

The  new  approach  to  clutter  suppression  is  essentially  one  of  signal  property  restoration  based 
upon  known  properties  of  the  desired  signal.  Undesired  signals  which  may  be  present  include 
direct  path  jamming,  jamming  via  multipath  clutter,  clutter  returns  due  to  the  radar,  and  ambient 
thermal  noise.  The  "art"  of  the  signal  restoration  approach  is  to  determine,  or  construct,  the  unique 
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properties  of  the  desired  signal  which  enable  suppression  of  the  undesired  signals. 

The  class  of  transmitted  signals  studied  are  the  polyphase  coded  pulse  compression  waveforms. 
This  class  encompasses  the  direct-sequence  and  discretized  chirp  signals.  An  important  distinction 
of  the  new  approach  is  that  the  signal  processing  occurs  before  the  pulse  compression.  This  allows 
greater  latitude  in  utilizing  the  desired  signal  properties  to  enhance  the  suppression  process. 

The  discrete-time  signal  that  is  used  for  analysis  can  be  derived  from  the  general  transmitted 
pulse  signal 

sit)  =  ait)  cos{2nf.t  +Q(t)) 


where  ait)  represents  the  amplitude  modulation  function  (assumed  constant  for  the  remainder  of 
this  report),  is  the  carrier  frequency,  and  Qit)  is  the  phase  modulation  function.  The  low  pass 
equivalent  representation  of  sit),  i.e.,  the  complex  baseband  signal,  is  given  by 

sit)  =  ait)(cosiQit))  +j  sin{0(<))) 

The  pulse  has  duration  denoted  by  T  time  units.  The  phase  function  has  the  form 

N 

Qit)  =  exp[7(l)„]  (3) 


where  the  minor  pulse  duration  is  =  TIN .  The  pulse  pit)  is  zero  outside  the  interval 
0  <  t  <  T^.  The  output  samples  of  a  matched  filter  (matched  to  ait)  pit))  sampled  at  the  rate 
1  /  T^,  are  given  by  the  constants. 
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Vk  =  exp[j>J 


(4) 


It  should  be  noted  here  that  pulse  compression  has  not  yet  occurred.  Pulse  compression  can  only  be 
achieved  by  multiplication  by  the  phasors 

1.1  Received  signal  model 

The  received  signal  at  the  output  of  the  k  sensor  matched  filter  at  time  t  =  nT^  is  represented 
by 

=  v^{n)  +  qi.(n)  +  c^{n)  +  W/^in)  (5) 

where  the  disturbances  are  (7*(n),  the  jammer  signal,  c^(n),  the  terrain-scattered  clutter  signal, 

and  the  wideband  thermal  noise.  In  general,  all  the  signal  samples  will  be  complex  valued. 

The  jammer  is  assumed  time  uncorrelated  and  sensor  correlated.  The  clutter  is  time  correlated 
over  a  pulse  duration  and  sensor  uncorrelated.  The  thermal  noise  is  time  and  sensor  uncorrelated. 
These  properties  will  be  used  to  discriminate  between  the  desired  signal,  V/^in),  and  the 
disturbances. 

1.2  Space-time  signal  model 

The  space-time  signal  matrix  will  have  the  form 


T 

Din)  = 

... 

= 

... 

r^n) 

• Kji 

21-5 


where  -  \rJ^rL-mN^)  r^{n-l-mN^)  ...  r^{n-N-mN^)j  is  the  complex  received  sample 

vector  for  the  k  sensor  during  the  m  pulse  period.  The  pulse  repetition  period  is  =  N^T^. 


1.2.1  Target  signal  vector  component 


The  target  embedded  in  r^(n)  will  be  at  a  spatial  angle  0^,  with  power  P„  and  Doppler  shift  so 
that  the  desired  signal  vector  is  given  by 


v^(n) 


exp{  -i  27c  (^  - 1 )  rf,  sine^  exp[;’  2  tt  0/^ }  exp[/'  c)) - 1) 

exp{-j2n:  (^-l)d,sineJexp[/27:  l/‘rf}exp(/({)^„„j^v.2} 


_exp(-72n:  (A-l)d^sine4exp{i2n  (A^-l)/;}exp(;(})^^,^iv-v}, 


(7) 


The  autocorrelation  function  of  Uf^{n)  will  depend  on  both  the  spatial  index  k  and  the  time  index 

n. 

1.2.2  Jammer  signal  vector  component 

The  jammer  signal  portion  of  the  received  vector  at  spatial  angle  0^  with  power  is  represented 
by 
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g*(n)  = 


2 


exp{-y27:  (k-l)d^smd^){q^^{n)  +jgi*(n)} 
exp{  -y  2k  ik  - 1  )d,  sine  J  {g^^(n + 1)  +y  + 1) } 

exp{-y27r  (^-l)rf^.sineJ{g^^(n+MAO  -t-jq^^in+MN)] 


(8) 


where  Q^kin)  and  are  i.i.d.  Gaussian  random  sequences  with  zero  mean  and  unit  variance. 

Thus,  the  sequence  q  will  exhibit  only  spatial  correlation. 

1.2.3  Clutter  signal  vector  component 

The  terrain  clutter  vector  component,  C/^in),  will  be  formulated  as  the  response  of  a  discrete-time 
linear  filter  to  an  i.i.d.  Gaussian  random  sequence. 


c*(n) 


2 


Crk^n)  +jc-,^(n) 

c,.t(n+l)+yc;*(n  +  l) 


c^^(.n  +MN)  + j  +MN) 


(9) 


where  sets  the  power  of  the  clutter.  The  filtering  will  result  in  the  output  process  having  a 
Gaussian  autocorrelation  function  that  specifies  the  relative  Doppler  frequency,  f^,  and  relative 
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bandwidth,  B^^,  of  the  clutter  power  spectral  density,  viz.. 


p^{m,k')  =  E{c^in)c^,{n+m)] 


5ik-k')  exTp[-j2Kf^m}  exp(-2[7TS,^m]'^} 
0 


I  TO  I  <  N 
ItoI  >  N 


(10) 


where  bik)  is  the  Kronecker  delta  function  which  equals  unity  when  k  =  0,  otherwise  it  equals 
zero.  The  clutter  is  not  correlated  from  pulse  to  pulse. 


1.2.4  Thermal  noise  component 

The  thermal  noise  will  be  uncorrelated  in  both  time  and  space  and  will  have  power 

w^^(.n)+Jiv^f^{n) 
w^^{n  +  l)+jw^^{n+l) 

w^^{n  +MN)  +jw^^{n  +MN) 


wdn)  = 


P., 

2 


where  lOi.i^(.n)  and  are  i.i.d.  Gaussian  random  sequences  with  zero  mean  and  unit  variance. 


2  METHODOLOGY  OF  SIGNAL  RESTORATION  APPROACH 


The  received  signal  vector,  r^(n),  belongs  to  a  vector  space  of  dimension  MxN .  The  desired 

signal  vector  will  be  restricted  to  a  subspace  of  this  vector  space.  Thus,  any  components  in  the 
received  signal  vector  that  are  outside  the  desired  signal  subspace  can  be  discarded  without  a  loss 
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in  the  fidelity  of  the  desired  signal.  The  restoration  approach  relies  on  performing  operations  on 
the  received  signal  vector  which  attempt  to  retain  only  the  desired  signal  components. 

The  process  of  signal  restoration  used  is  iterative,  where  each  iteration  results  in  a  vector  which  is 
closer  to  the  desired  signal  vector.  The  iterative  implementation  avoids  such  problems  as 
construction  and  inversion  of  the  received  signal  correlation  matrix.  The  updates  are  usually  a 
mix  of  linear  and  non-linear  operations.  The  non-linear  updates  are  guaranteed  to  converge  if  they 
are  convex,  e.g.,  they  describe  spheres  in  the  vector  space.  A  weaker  condition  is  that  the  update 
be  non-expan sive,  i.e.,  the  updated  vector  is  guaranteed  to  have  finite  length.  The  non-expansive 
condition  only  guarantees  that  the  updates  will  not  diverge.  Some  properties  of  the  desired  signal 
that  are  used  to  formulate  the  update  process  are  amplitude  (time  and  frequency),  energy,  and 
correlation. 

2.1  Signal  properties  used  for  updates 

A  set  of  update  equations  are  needed  which  selectively  suppress  the  undesired  components  of  the 
received  signal.  Each  update  can  be  supplied  sequentially,  converging  to  the  desired  target  signal 
estimate.  The  update  equations  will  rely  on  using  known  propertied  of  the  desired  received  signal 
which  are  not  generally  descriptive  of  the  undesired  disturbances. 

2.1.1  Amplitude 

The  desired  signal  may  be  bounded  in  amplitude  in  either  the  time  or  frequency  domain.  Any 
estimate  can  be  constrained  to  adhere  to  this  condition  using  the  update 
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0,Jn)  =  1 


max 


(12) 


t),(n) 


if  vfn)  > 
if  vfn)  < 


where  is  the  expected  maximuTn  for  the  desired  signal  vector. 


2.1.2  Energy 


If  the  desired  signal  can  be  bounded  in  energy,  then  the  following  update  is  appropriate 


V, 


ViV^ 


if  v'!vi  > 


(13) 


where  {'Y^  is  the  complex  conjugate  (Hermitian)  transpose  operation. 


2.1.3  Linear  updates 

The  previous  updates  are  considered  non-linear.  A  linear  update  can  be  achieved  using  the 
formula 

=  (5,  +  p[d-ar"f),]  (14) 

v,v, 


where  the  scalar  d  and  the  vector  x  are  chosen  to  correspond  to  some  property  of  the  vector  v^. 


The  scalar  pi  will  determine  how  well  the  estimate  satisfies  the  conditions.  For  example,  if  we 
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know  that  uin)  -  Uy  ,  then  the  update  vector  is  given  by  x  =  1  1  ...  1  and  the  estimate 


vi.i  =  +  [uv 

ViVi 


(15) 


will  result  in  Vi^iM  =  u-^ . 


All  the  preceding  updates  can  be  applied  to  the  received  data  iteratively.  In  general,  it  will  take 
only  one  pass  through  all  the  updates  to  provide  reasonable  clutter  suppression  performance. 

2.2  Polyphase  codes 

Pulse  compression  based  upon  polyphase  waveform  transmission  is  assumed.  A  major  departure 
from  current  techniques  is  that  signal  processing  occurs  before  pulse  compression  takes  place  (see 
Figure  1).  The  pre-compression  processing  allows  use  of  the  phase  code  properties  in  suppressing 
the  clutter.  That  is,  the  pre-compression  waveform  has  structure  that  can  be  used  before  it  is  lost 
in  the  compression  operation.  There  are  several  possible  polyphase  waveform  that  can  be  used  in 
pulse  compression  radar.  Two  possibilities  are  direct  sequence  codes  and  Frank  codes  [4].  Frank 
polyphase  codes  are  of  particular  interest  because  they  represent  discrete  approximations  to  linear 
FM  waveforms.  The  remainder  of  this  report  will  assume  the  use  of  Frank  codes,  although  the 
principles  of  the  signal  restoration  processing  apply  to  any  polyphase  waveform. 

The  Frank  polyphase  codes  can  be  derived  from  the  complex  exponentials  used  in  the  discrete 
Fourier  transform  (DFT).  Given  the  DFT  of  length  N,  the  Frank  code  is  given  by 

where  the  codes,  Pj^,  are  the  elements  of  the  DFT  matrix  of  dimension  N'^. 
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p*  =  exp{-J23x mod(fe^-l)  modik/iN-DJ^-l)  k-l,2,...N'^ 


(16) 


2.3  Clutter  suppression  using  temporal  decorrelation 

The  model  of  the  terrain-scattered  jamming  clutter  is  that  it  is  temporally  correlated,  due  to  the 
multiple  scattering  of  the  ground,  up  to  a  delay  equal  to  the  PRI.  The  transmitted  polyphase 
waveform  exhibits  strong  correlation  only  at  delay  intervals  equal  to  multiples  of  the  PRI.  This 
property  leads  to  the  "thumbtack"  ambiguity  function  for  polyphase  waveforms.  A  conclusion  is 
that  temporal  correlation,  at  delays  other  than  multiples  of  the  PRI,  in  the  received  radar  signal 
is  primary  due  to  the  terrain -scattered  clutter.  Thus,  applying  a  decorrelation  filter  prior  to  pulse 
compression  will  remove  a  significant  portion  of  the  clutter  energy  (see  Figure  1).  Of  course,  the 
polyphase  waveform  is  also  affected,  however,  to  a  lesser  degree.  The  decorrelation  filter  length 
should  be  less  than  the  length  of  the  polyphase  code  (PRI)  to  avoid  code  cancellation.  This 
condition  is  easily  satisfied  in  most  situations.  The  decorrelation  filter  can  be  a  simple  linear 
predictive  structure 

.V, 

Ok  *  -  E  [Ck-n-^Vk-n*^k-n] 


where  the  bracketed  terms  are  the  available  data.  The  error  in  the  prediction  process  is  actually 
an  estimate  of  the  radar  return  plus  noise,  since  these  components  are  not  predictable  compared  to 

the  clutter.  It  should  be  noted  that  the  prediction  filter  weights,  ,  can  be  determined  by  an 

appropriate  adaptive  algorithm  without  the  need  for  an  external  reference  signal.  That  is,  an 
advantage  of  processing  the  signal  prior  to  pulse  compression  is  that  the  desired  polyphase 
component  appears  uncorrelated  and  is  unaffected  by  the  decorrelation  filter. 


21-12 


3  RESULTS 


A  set  of  simulation  experiments  were  conducted  which  demonstrate  the  performance 
improvements  using  signal  restoration  processing.  The  adjustable  system  parameters  are  the 
number  of  polyphase  codes  per  pulse,  pulses  available  for  processing,  and  linear  array  of  sensor 
elements.  The  performance  criterion  of  primary  interest  is  the  output  signal-to-residual- 
interference  (SINK)  power  ratio.  The  residual  interference  is  defined  as  the  sum  total  of  all  the 
direct  path  jamming,  terrain-scattered  jamming,  and  thermal  noise  that  remains  after  processing, 
at  the  input  of  the  detector.  Other  parameters  of  the  simulations  are  the  direct  path  jamming 
power,  the  direct  path  jamming  direction,  the  clutter  bandwidth,  the  clutter  power,  the  clutter 
mean  Doppler  frequency,  and  the  target  Doppler  frequency.  The  power  levels  are  determined 
relative  to  the  target  power,  assumed  unity  for  convenience.  The  Doppler  frequencies  are 
determined  relative  to  the  center  frequency  of  the  radar  and  are  in  the  units  of  cycles-per-sample. 
The  target  direction  is  assumed  to  be  orthogonal  to  the  array,  i.e.,  at  boresight. 

Simulation  1.  A  first  look  at  the  signal  restoration  process  is  given  in  Figure  3.  This  figure  shows 
the  power  spectral  density  (PSD)  before  and  after  clutter  suppression.  The  clutter  parameters  are 
center  frequency  0.4Hz,  bandwidth  O.lHz,  and  relative  power  20dB  with  respect  to  the  target 
signal.  The  number  of  pulses  processed  is  eight  (8)  with  32  polyphase  codes  per  pulse.  The  direct 
path  jamming  is  assumed  to  be  zero  for  this  simulation.  The  PSD  of  the  processed  signal  shows 
better  registration  with  the  desired  signal  PSD  than  does  the  unprocessed  signal.  This  is  the 
contributing  factor  in  the  subsequent  demonstrations  of  output  SINK  improvement. 

Simulation  2.  The  next  simulation  consists  of  32  pulses,  256  polyphase  codes/pulse,  and  8  array 
sensors.  The  clutter  bandwidth  is  0.1  Hz  with  power  20dB.  The  jammer  is  at  45  degrees  from  the 
array  boresight  with  power  40dB.  The  target  Doppler  frequency  ranges  from  zero  to  a  maximum  of 
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0.01  cycles/sample.  This  maximum  Doppler  is  a  typical  value  for  maximum  velocity,  minimum 
wavelength,  and  low  bandwidth  conditions  [1],  The  output  SINK  improvement  is  shown  in 
Figure  4. 

Simulation  3.  The  parameters  of  this  simulation  are  identical  to  those  of  Simulation  2,  except  that 
the  clutter  bandwidth  is  reduced  to  0.03  Hz.  The  results  are  shown  in  Figure  5. 

Simulation  4.  In  this  simulation,  the  number  of  pulses  is  128,  the  number  of  codes/pulse  is  64,  and 
the  number  of  sensors  is  again  eight  (8).  The  results  are  shown  in  Figure  6. 

4  CONCLUSIONS 

A  new  method  for  terrain-scattered  jamming  suppression  is  reported.  This  new  method  relies  on 
processing  a  coherent  pulse  train  of  polyphase  coded  waveforms  prior  to  pulse  compression.  The 
pre-compression  signal  properties  are  used  for  decorrelation  and  signal  restoration.  The  average 
increase  in  SINK  is  approximately  6dB  across  Doppler  frequency.  The  signal  processing  used  does 
not  involve  correlation  matrix  calculation  or  inversion.  Also,  each  pulse  can  be  processed 
separately,  then  combined  prior  to  detection.  This  makes  the  new  approach  amenable  to  parallel 
processing  architecture  implementation.  Future  extensions  of  this  work  can  be  the  use  of  the 
partial  correlation  properties  of  the  polyphase  codes  and  the  analysis  of  existing  pulse  compression 
radar  data  containing  terrain-scattered  clutter. 
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Figure  1  System  block  diagram  showing  major  signal  processing  functions. 


Figure  2  Signal  properties  can  be  translated  into  projection  surfaces.  A  sequence  o: 
projections  onto  these  surfaces  converges  to  an  intersection,  describing  the  desired  signal. 
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Figure  3  The  power  spectral  density  comparison.  The  true  PSD  is  given  by  the 
clutter  +  signal  PSD  ’ - and  the  restored  PSD 


Figure  4  Output  SINK  improvement  of  new  approach  compared  to  matched  filter 
performance.  System  parameters:  32  pulses,  256  codes/pulse,  8  sensors.  Clutter  bandwidth 
is  0.1  Hz. 


Target  Doppler  Frequency  (tycles/sample)  xlO 


Figure  5  Output  SINK  improvement  of  new  approach  compared  to  matched  filter 
performance.  System  parameters;  32  pulses,  256  codes/pulse,  8  sensors.  Clutter  bandwidth 
is  0.03  Hz. 


Figure  6  Output  SINK  improvement  of  new  approach  compared  to  matched  filter 


performance.  System  parameters;  128  pulses,  64  codes/pulse,  8  sensors.  Clutter  bandwidth 


is  0.1  Hz. 
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Abstract 

A  cadmium  telluride  (CdTe)  optic  fiber,  and  a  fast  electronic 
switching  circuit,  were  constructed  and  studied  for  use  in  an 
Optical  Time-Domain  Multiple-Access  (OTDMA)  network.  The  fiber  was 
to  be  used  in  constructing  a  broad-band  optical  amplifier.  The 
switching  circuit  was  integrated  into  a  lithium  niobate  electro¬ 
optic  carrier  modulator. 

The  fiber  core  had  a  CdTe-glass  hyperlattice  structure.  The 
diameter  of  the  glass-core  was  20  microns  surrounded  by  a  CdTe 
cylindrical  ring  of  thickness  20  armstrongs.  The  absorption  and 
transmission  spectra  of  the  fiber  were  characterized.  The  results 
were  compared  with  those  of  bulk  CdTe  with  remarkable  difference 
between  the  two.  The  absorption  spectra  for  the  fiber  was  broad¬ 
band  and  was  red-shifted  in  comparison  with  the  bulk. 

The  switching  circuit  had  fast  rise-  and  fall-times  of  2 
nanoseconds  and  converted  TTL  logic  gate  levels  (as  inputs)  to  10 
volt,  200  milliamp  output  levels  for  the  electro-optic  modulator. 
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Characterization  Of  A  CdTe-Doped  Fiber  and  The  Design  of  A  Wave- 
Shaping  Circuit  For  Optical  TDMA  Network  Implementation. 

Eric  Donkor 


I .  Introduction 

Optical  time-division  multiple-access  (OTDMA)  networks  and 
packet  switching  architectures  are  currently  being  investigated  and 
implemented,  at  the  Photonic  Research  Center  of  Rome  Labs,  for 
high-speed  information  transfer  and  distribution  in  multi-user 
environment  with  intended  applications  in  integrated  systems 
digital  networks,  local  area  and  micro  area  networks.  A  five 
gigabit/second  50-channel  optical  time-division  multiple-access 
interconnect  has  already  been  demonstrated  (see  R.  K.  Boncek  et. 
al..  Optical  Engineering,  pp  2442,  Nov  '92).  Efforts  to  implement 
even  faster  architectures  are  under  investigation.  The  speed  of 
operation  is  currently  limited  by  the  performance  of  the 
electronic  devices  and  circuits  that  are  used  to  implement  some  of 
the  various  subsystems  of  these  architectures. 

Two  approaches  are  being  researched  at  Rome  Labs  to  circumvent 
such  electronic  bottlenecks.  In  one  approach,  research  is  directed 
at  the  design  and  implementation  of  all-optical  network 
architectures.  Consequently,  several  projects  are  under  way  to 
produce  optical  materials,  devices,  and  circuits  that  will 
ultimately  be  used  for  such  all-optical  time-division  multiple- 
access  network  implementation.  They  include  photostable  ultra-fast 
optical  switches  and  switching  networks,  tunable  optical  delay 
devices,  high  bandwidth  optical  modulators,  and  novel 
semiconductor-glass  composite  fiber  optic  amplifiers. 

Another  approach,  that  already  has  produced  promising  results, 
is  directed  at  the  design  and  implementation  of  hybrid 
electronic/photonic  TDMA  architectures.  In  this  hybrid  approach 
optical  techniques  are  used  to  time-division  multiplex  signals  from 
all  users  unto  a  shared  optical  link.  The  multiplexed  signal  is 
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broadband  broadcasted  to  all  receivers.  The  data  transmission  rate 
typically  run  in  the  multi-gigabit  range.  Consequently, 
conventional  electronics  will  be  inadequate  to  directly  process 
these  signals  at  the  receiving  ends. 

Thus  at  each  receiver,  optical  techniques  are  employed  to 
demultiplex  and  extract  the  desired  baseband  data  from  the 
broadband  signal.  The  baseband  signal  can  then  be  processed  with 
electronic  devices  and  circuits.  Critical  devices  required  to 
implement  this  architecture  include  tunable  electro-optic 
modulators  and  photoconduct ive  device  that  can  be  used  to  perform 
optical  correlation  functions,  and  fast  pulse-shaping  electronics. 

This  summer,  July  5  to  September  3,  I  participated  in  two 
research  projects  at  Rome  Labs.  One  was  to  characterize  the  optical 
properties  of  a  cadmium  telluride-glass  composite  fiber.  In 
particular  we  were  interested  in  the  occurrence  of  quantum  size 
effect  phenomena  and  other  optical  nonlinear  effects  in  such 
fibers.  In  the  second  project,  a  fast  electronic  switching  and 
pulse-shaping  circuit  was  implemented.  It  was  intended  to  be 
integrated  into  a  tunable  electro-optic  modulator  subsystem  for  a 
hybrid  TDMA  architecture. 

This  report  summarizes  my  research  activities  at  Rome  Labs.  It 
is  divided  into  four  sections.  Section  I  to  Section  IV,  of  which 
this  introduction  part  is  Section  I.  Section  II  reports  on  work 
relating  to  the  CdTe-glass  composite  fiber,  and  Section  III 
describes  the  implementation  of  the  electronic  subcircuit  for  the 
electro-optic  modulator  subsystem  of  the  hybrid  TDMA.  The  last 
section.  Section  IV  covers  the  conclusions  of  the  work. 

II.  Optical  Characterization  of  CdTe-Glass  Fiber. 

Al.  Description  of  The  Fibr^r  Structure 

The  CdTe-glass  fiber  i  is  a  composite  CdTe-glass  core  and  a 
glass  cladding.  The  preform  was  fabricated  by  first  depositing  a 
CdTe  film  on  the  entire  outer  diameter  of  a  pyrex  glass  rod  at 
elevated  temperatures.  This  was  then  placed  inside  another  pyrex 
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glass  tube  and  shrunk  to  produce  a  preform.  The  preform  was  about 
9mm  in  total  diameter.  The  inner  glass  core  of  the  preform  was 
about  4mm  in  diameter.  The  CdTe  (circular)  film  has  a  thickness  of 
0.15/xm  and  the  outer  glass  has  a  thickness  of  2.5mm.  A  fiber  was 
pulled  from  the  preform.  The  fiber  core  has  an  inner  glass  rod  of 
diameter  3  0^im  surrounded  by  a  15-20  armstrongs  thick  C1Te 
cylindrical  ring  (or  film)  .  The  cladding  was  about  20^m  thick.  T  e 
CdTe  film  has  a  higher  index  of  refraction  than  the  surrounding 
cladding  and  as  a  result  much  of  the  incident  laser  beam  is 
confined  near  the  core  region. 

A2 .  Experimental  Procedure 

Experiments  were  conducted  to  measure  the  transmission  and 
absorption  characteristics  of  both  the  composite  fiber  and  the 
.15/xm  CdTe  ring  in  the  preform.  The  set-up  used  for  the 
measurements  is  shown  in  Fig.l.  A  100  watt  white  light  was  used  as 
the  source.  A  slit  was  placed  in  front  of  the  source  in  order  to 
control  the  width  of  the  emerging  beam. 

The  device  (fiber)  unoar  test  was  placed  between  the  two 
microscope  objectives  Ml  and  M2,  each  objective  has  a  numerical 
aperture  of  0.35  and  a  magnification  index  of  20.  A  converging  lens 
(KBX  064)  was  used  to  focus  the  light  from  M2  unto  the  detector  of 
a  wide-band  optical  spectrum  analyzer  (Ando  AQ-1425) .  Two  ends  of 
the  composite  fiber  were  cleaved  and  mounted  in  a  Newport  MF  916 
fiber  holder.  The  total  length  was  10cm.  The  mounted  fiber  was  then 
inserted  between  Ml  and  M2.  The  various  components  of  the  set  up 
were  adjusted  till  a  maximum  reading  was  displayed  and  recorded  as 
the  measured  data.  The  experiment  was  repeated  with  the  fiber  now 
removed.  The  resulting  display  was  recorded  as  the  reference  data. 
The  transmission  and  absorption  spectra  were  extracted  from  these 
measurements . 

A3.  Results  and  Discussion 

Fig. 2  shows  the  transmission  spectra  response  of  the  10cm 
CdTe-fiber  over  a  wave-length  interval  of  380nm  to  ISOOnm.  A  broad 
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Fig.  2  Optical  Spectra  of  CdTe  Fiber 
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Fig.  3  Optical  Spectrum  of  0.15um  thick  CdTe  Ring 
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Fig.  4  optical  Transmission  in  CdTe  Fiber.  (Fiber  length  10cm) 
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Fig.  5  Transmission  of  0.15um  thick  CdTe  Ring 
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peak  occurs  at  664  nm.  The  dip  at  1400ni[i  is  believed  to  be  due  to 
OH"  absorption.  A  similar  spectral  response  for  the  O.lS/xm  CdTe 
film  in  the  preform  is  shown  in  Fig  3.  It  has  very  low  response  at 
wavelengths  between  380nm  and  807nm.  and  a  flat  response  between 
807nm  to  1800nm. 

The  absorption  spectra  for  the  composite  fiber  and  the  CdTe 
film  are  shown  in  Fig. 4,  and  fig. 5  respectively.  In  fig. 4  we 
observe  a  broadband  high  absorption  region  for  the  fiber  in  the 
wavelength  interval  between  800nm  to  1400nm.  Fig. 5  depicts  a  high 
absorption  region  for  the  CdTe  film  in  the  region  between  380nm  to 
860nm  and  transmission  beyond  900nm. 

The  response  of  the  O.lS/im  CdTe  film  is  analogous  to  that  of 
bulk  CdTe.  However,  the  results  obtained  for  the  fiber  is 
remarkably  different  with  a  number  of  peculiarities.  In  particular, 
the  composite  fiber  shows  absorption  at  wavelength  higher  than  what 
would  be  expected  in  the  bulk  or  thin  film.  The  red-shift  in  the 
absorption  spectra  could  be  due  to  quantiam  size  effect  or  nonlinear 
optical  effects,  arising  from  the  generation  of  polaritons  or 
plasmon  waves.  Further  experimentation  will  be  needed  in  order  to 
reveal  the  origin  or  mechanisms  that  give  rise  to  the  red-shift  in 
the  absorption  spectrum. 

III.  Wave-Shaping  Circuit  For  Modulator  Subsytem  of  TDMA. 

Bl.  A  Block  Diagram  of  the  Hybrid  TDMA  Architecture. 

A  simplified  description  of  the  time  domain  multiple  access 
network  is  given  here.  Fig. 6  gives  a  block  diagram  of  the  TDMA 
network.  The  mode-lock  laser  generates  a  train  of  pulses,  whose 
width  and  repetition  rate  defines  the  maximum  and  baseband  data 
transmission  rate  of  the  architecture.  The  output  of  the  laser  is 
distributed  by  a  IXN  splitter  to  fixed  N-1  delay  lines.  Each  user 
is  dedicated  one  of  the  N-1  delay  lines.  A  fixed  delay  exists 
between  successive  lines.  The  last(N-th)  laser  source  is  also 
routed  to  a  IXN  splitter.  The  output  of  the  second  splitter  are 
routed  directly  to  the  receivers  to  be  used  as  clocking  signals. 

The  dedicated  lines  serve  as  carriers  to  be  modulated  with 
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user  baseband  signals.  The  modulated  signals  from  users  are 
multiplexed  unto  a  shared  transmission  link.  The  multiplexed 
signals  together  with  the  clocking  signal  are  distributed  to  all 
receivers.  A  transmitter  is  tuned  in  at  a  given  receiver  by  optical 
means.  Optical  demultiplexing  recovers  the  baseband  signal  which 
are  then  photodetected  and  processed  electronically. 

B2 .  Description  of  the  Electro-Optic  Modulator  Subcircuit. 

Modulation  of  the  carrier  with  baseband  signal  is  achieved  by 
electro-optic  techniques .The  key  element  is  a  lithium  niobate 
electro-optic  modulator.  Typical  drive  signals  are  8-12  volt  and 
200  mA  into  a  50  ohm  impedance.  However  the  logic  circuits  and 
systems,  that  generate  baseband  signals,  can  only  output  up  to  4 
volts  and  source  l0-50mA.  A  pulse  shaping  and  switching  circuit  was 
therefore  constructed  to  up-convert  logic  output  levels  to  the 
desired  voltage  and  current  levels  for  the  lithium  niobate  electro¬ 
optic  modulator. 

The  pulse-shaping  circuit  is  shown  in  fig  7.  The  transistors 
Q1  and  Q2  are  enhancement  mode  FETs  with  threshold  voltages  of  2.5 
volts.  Input  signals  for  the  circuit  are  either  address  encoding 
signals  or  user  data  signals.  These  signals  are  fed  to  the  gate  of 
Ql. 

When  an  input  signal  is  at  logic  high  it  turns  on  Ql.  In  this 
case  the  voltage  at  the  source  of  Q2  will  be  pulled  to  ground 
through  Ql  and  therefore  the  output  at  Q2  will  be  low.  Suppose  the 
input  now  makes  a  transition  from  the  high  state  to  a  low  state,  Ql 
will  be  turned-off.  Now  the  voltage  at  the  source  of  Q2  will  be 
the  same  as  the  drain  to  source  voltage  of  Ql.  Resistor,  R,  is 
chosen  such  that  the  source-to-drain  voltage  of  Ql  is  lOvolts, 
adequate  to  actuate  the  lithium  niobate  modulator. 
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B3.  Results  and  Discussion. 

Fig  8  shows  a  typical  reponse  of  the  wave-shapping  and 
switching  circuit  of  fig. 7.  The  lower  trace  represents  the  input 
and  the  upper  trace  the  output  of  the  circuit.  The  input  was  a  0- 
2.5  volt  50%  duty  cycle  square  wave. The  output  is  an  inverted 
square  pulse  with  voltage  swing  of  0-10  volt.  The  rise  time  of  the 
output  was  measured  as  20  nanoseconds  the  fall  time  as  5ns.  The 
rise  and  fall  times  of  the  input  were  4  nanaseconds. 


Fig.  8.  Reponse  of  The  Wave-Shapping  Circuit. 


An  overshoot  is  observed  in  the  rising  edge  of  the  output 
signal.  We  were  able  to  eliminate  this  overshoot  by  increasing  the 
value  of  resitor,  Rl.  However  this  was  achieved  as  a  trade-off  for 
lower  output  voltage  and  slower  rise  and  fall  times.  Also  the 
overshoot  could  be  reduced  by  lowering  the  applied  bias. 
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III.  Conclusions. 

The  measured  absorption  and  transmission  spectra  of  CdTe-glass 
composite  spectra  were  remarkably  different  from  bulk  CdTe.  The 
difference  appears  to  originate  from  quantum  size  effects.  Further 
measurements,  such  as  photo luminence  should  help  to  reveal  the 
source  of  this  discrepancies. 

The  switching  circuit  designed  for  the  electro-optic  modulator 
had  rise  time  of  20ns  and  fall  time  of  5ns.  The  response  time  was 
not  affected  by  the  duty  cycle  of  the  input  signal.  The  circuit 
could  convert  an  input  square  wave  with  a  swing  of  0-2 . 5  volt  to  a 
square  wave  output  having  a  voltage  swing  of  0-10  volt  and  current 
of  200mA. 
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Abstract 


Accurate,  low-noise,  wideband  phased  array  radar  antennas  depend  on  the 
use  of  true  time  delays  between  microwave  elements.  A  photonic  means  of 
generating  the  required  element-to-element  phase  shifts  was  developed  by 
Toughlian  and  Zmuda  for  systems  operating  in  the  VHF  frequency  range,  and 
extended  by  Monsay  and  Caccuitto  for  operation  in  the  radar  C  band.  The 
photonic  beamformer  is  based  on  an  acousto-optic  (AO) ,  heterodyne  optical 
system,  in  which  the  reference  beam  acts  as  a  probe  of  the  acoustic  wave  in 
the  AO  cell.  The  time  delay  of  the  acoustic  waveform  "read  out"  by  the 
optical  probe  from  various  locations  along  the  length  of  the  AO  cell  can  then 
be  imposed  directly  on  the  microwave  elements  of  the  radar  beamformer, 
providing  the  correct  phase  for  each  element  at  whatever  frequency  drives  the 
AO  cell. 

It  is  apparent  that  the  benefits  of  this  photonic  system  rely  greatly  on 
the  ability  of  the  probe  beam  to  pick  out  just  the  right  phase  shift/time 
delay  from  the  acoustic  waveform  in  the  AO  cell.  It  was  suggested  in  last 
summer's  report  that  laser  instability  resulting  in  small  changes  in  beam 
direction  would  have  an  impact  on  radar  system  performance  because  of  this 
dependence.  In  the  interim,  it  became  clear  that  a  potentially  more  basic 
limitation  to  the  system  performance  could  come  from  intrinsic  diffraction 
effects  due  to  the  finite  beamwidth  of  the  probe  in  the  AO  cell.  The  latter 
effect  has  been  studied  in  detail  and  its  implications  for  design  and 
performance  of  the  photonic  beamformer  are  discussed  here. 
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FORM  AND  IMPLICATIONS  OF  THE  NONLINEAR  DEPENDENCE 
OF  PHASE  ON  FREQUENCY  FOR  AN  ACOUSTO-OPTIC  BEAMFORMER 


Evelyn  H.  Monsay 


I.  INTRODUCTION 


Within  the  past  few  years,  Toughlian  and  Zmuda  [1-4]  have  developed  an 
RF  true  time  delay  phase  shifter,  based  on  a  heterodyne  optical  system,  which 
can  act  as  a  beamformer  for  high  performance,  wideband  phased  array  radar 
systems.  Sumberg  [5]  and  Baldwin  (6)  have  investigated  means  for  integrating 
the  optical  system.  In  Monsay  [7]  and  Monsay  and  Caccuitto  [8],  the  results 
of  using  a  dichromatic  laser,  demonstrated  the  potential  of  the  photonic 
beamformer  for  providing  true  time  delays  for  high  frequency  (C  band)  radar 
systems . 

Various  potential  operational  problems  were  also  considered  [7].  Since 
the  photonic  delay  line  is  based  on  a  heterodyne  optical  system,  some  laser 
noise  suppression  is  automatically  built  in.  In  low-frequency  (VHF)  systems, 
both  reference  and  signal  beams  carry  the  same  noise  disturbances  which  can  be 
made  to  cancel  upon  coherent  interference.  The  reference  and  signal  beams  in 
the  dichromatic  laser  or  coherent  laser  system  required  for  high-frequency 
(C  band,  etc.)  operation  will,  to  the  degree  of  coherence  exhibited  by  the 
particular  laser  or  laser  system,  also  tend  to  suppress  laser  jitter  and 
drift. 

However,  it  was  suggested  [7]  that  the  most  important  potential  problem 
for  the  photonic  delay  line  would  arise  from  pointing  errors  of  the  laser  beam 
along  the  acoustic  wave  in  the  AO  cell.  Such  pointing  errors  could  arise  from 
mode  hopping  in  the  laser,  or  from  mechanical  dithering  of  the  rotating  mirror 
which  directs  the  probe  beam.  In  addition,  small,  but  not  negligible, 
oscillations  in  the  data  of  phase  versus  frequency  were  observed  in  both  low- 
and  high-frequency  photonic  delay  lines,  potentially  exaggerating  the  effect 
of  small  deviations  in  pointing  angle  on  the  accuracy  of  the  final  radar 
system.  These  "wiggles"  in  the  data  proved  to  be  highly  reproducible;  they 
were  apparently  not  random  noise,  but  some  true  physical  effect  requiring 
further  study  of  the  optical  system. 

The  study  of  the  diffraction  of  the  optical  probe  beam,  presented  in 
this  report,  has  provided  a  theoretical  understanding  of  the  "wiggles"  in  the 
data.  This  derivation,  along  with  the  modeling  of  the  effects  of  the  detector 
aperture,  will  allow  for  a  thorough  determination  of  the  impact  of  laser  and 
mirror  pointing  error  on  the  associated  radar  system  performance. 

In  Section  II,  the  basic  operation  of  the  photonic  true  time  delay  line 
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is  described,  and  the  data  on  both  low-  and  high-frequency  demonstration 
systems  are  presented  and  discussed.  The  modeling  of  the  diffraction  of  the 
AO  cell  probe  beam  is  presented  in  Section  III,  with  results  and  a  comparison 
to  data.  In  Section  IV,  the  finite  aperture  of  the  photodetector  is 
explicitly  included  in  the  equations  for  the  system  and  a  preliminary 
evaluation  of  its  impact  derived.  A  final  discussion  of  this  work  and 
recommendations  for  future  work  are  presented  in  Section  V. 

II.  BASIC  PHOTONIC  DELAY  LINE  THEORY 

Figure  1  presents  the  architecture  of  the  basic  photonic  true  time 
delay  line,  using  a  dichromatic  laser  as  for  a  high-frequency  radar  system.  , 
The  optical  configuration  is  a  standard  heterodyne  system  with  signal  and 
reference  beam  frequencies  of  fj  and  f^,  and  amplitudes  Aj  and  A„, 
respectively.  A  lens  of  focal  length  F  transforms  the  point-like  probe 
illumination  in  the  AO  cell  into  a  plane  wave.  The  AO  cell  is  used  in  the 
Bragg  mode  of  operation.  With  AO  cell  RF  frequency  offset  f„,  the  resulting 
optical  beams  are 
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where  <|)  is  an  optical  phase  shift  representing  the  accumulated  phase 
difference  of  the  two  beams  as  they  propagate  through  the  system.  In  Equation 
(2),  fx  is  the  spatial  frequency  of  the  heterodyned  output,  where  there  is  a 
unique  identification  of  spatial  frequency  with  mirror  position,  and  so  with 
the  location  of  the  probe  beam  in  the  AO  cell.  The  spatial  frequency 
equals  the  mirror  position  angle  0  divided  by  optical  wavelength  X,  and  is 
uniquely  associated  with  a  particular  mirror  position,  and  so  a  unique  time 
delay  in  the  acoustic  time  waveform.  The  heterodyned  output,  ie,  the  coherent 
addition  of  the  two  optical  beams,  has  the  units  of  optical  power  and  is  given 
by  the  square  of  the  sum  of  the  two  complex  beam  amplitudes.  This  operation 
results  in  a  constant  dc  background  illumination  plus  the  system  output  signal 
with  frequency  dependence  given  by 
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In  Equation 

Af  ={fs-fR), 
operates  in 

(  Af +  fM(max)  )  • 
given  by 


(3),  the  self-heterodyne  frequency  of  the  laser  output, 
appears,  modulated  by  the  RF  frequency  f„.  Hence,  the  system 
a  band  defined  by  lower  frequency  Af  and  upper  frequency 
Note  that  any  additional  phase  shift  imposed  on  either  beam,  here 


$=2k  [Ar+fj^]  t) +<j)  (4) 

comes  through  directly  in  the  final  detected  optical  signal. 

Figure  2  indicates  the  selective  action  of  the  system  optics  within  the 
Bragg  cell,  where  a  specific  phase  shift  can  be  "picked  off"  from  the  acoustic 
wave  in  the  cell.  The  rotating  mirror  will  establish  a  location,  and  so  a 
particular  time  delay,  along  the  acoustic  wave  relative  to  its  launch  point  in 
the  cell.  As,  f„  changes,  the  phase  of  the  acoustic  wave  at  that  fixed  point 
in  the  Bragg  cell  will  also  change,  such  that  a  linear  dependence  of  optical 
phase  on  RF  frequency  -  the  hallmark  of  the  true  time  delay  phase  shift  -  is 
established.  Hence,  for  a  wideband  RF  signal  with  upper  and  lower  frequencies 
fu  and  fi,,  respectively,  a  true  time  delay  phase  shift  for  each  frequency 
component  of  the  RF  signal  will  be  available  with  varying  magnitude  dependent 
on  which  particular  point  within  the  Bragg  cell  is  chosen  by  one,  or  more  (for 
multiple  antenna  elements),  rotating  mirror (s). 

The  previous  discussion  is  based  on  the  assumption  that  the  optical 
probe  of  the  AO  cell  is  performed  by  a  point-like  beam.  The  resulting 
dependence  of  read-out  phase  $  on  system  frequency  f  =  (Af  +  f„)  is  then 
perfectly  linear; 


V 


(5) 


However,  the  data  on  ^(f)  for  fixed  x  from  demonstrations  of  both  low- 
frequency  (Figure  3)  [4,5]  and  high-frequency  (Figure  4)  [7,8]  photonic  delay 
lines  show  very  regular  oscillations  superimposed  on  the  correct  [9]  linear 
dependence.  Additional  study  of  the  photonic  delay  line  architecture  with  a 
wide  range  of  focal  lengths,  laser  frequencies,  and  other  parameters  indicated 
that  the  oscillations  were  relatively  independent  of  the  parameter  values  over 
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a  sizeable  range  [10]. 

Two  candidate  effects  could  possibly  create  these  oscillations:  the 
windowing  effect  of  the  finite  aperture  photodetector,  and  the  diffraction 
from  a  finite-sized  (non-infinitesimal)  probe  beam.  Both  effects  were 
studied  and  found  to  contribute  to  nonlinear  behavior  in  4.  The  following 
sections  present  a  detailed  understanding  of  these  effects. 


III.  EFFECTS  OF  A  FINITE-WIDTH  PROBE  BEAM 


The  original  analysis  of  the  photonic  delay  line  assumed  an 
infinitesimal,  point-like  probe  beam  reading  out  the  phase  of  the  acoustic 
wave  in  the  AO  cell.  However,  realistically,  the  ability  to  focus  the  probe, 
beam  is  limited  by  diffraction  effects.  For  unfocussed  beam  diameter  d  and 
lens  of  focal  length  F,  the  finite  spot  size  of  the  probe  beam  in  the  AO  cell 
is  given  by  the  function 


P 


ix)  =  J^sin(^) 

TTX  do 


where  d^  =(AF)/d.  The  Fourier  transform  of  the  probe  function  is 


(6) 


P{k)  =rect  (icdg) 


(7) 


where  k=27t/A  and  A  is  the  acoustic  wavelength.  The  space  and  time  dependence 
of  the  acoustic  wave  in  the  AO  cell  is  given  by 


COS[27lf{t-— )]  =COS  [Bx-O)  t]  , 
V 


(8) 


with  p=(o/v,  V  being  the  acoustic  velocity  in  the  AO  cell,  and  using  the 
identity  cos(-x)  =  cos(x).  The  acoustic  wave  will  be  illuminated  by  the 
signal  beam  over  the  range  -a  ^  x  s  a,  ie,  by  the  aperture  function 


w{x)  =rect{  —  )  .  (9) 

a 

Hence,  the  illuminated  portion  of  the  acoustic  wave  in  the  AO  cell  can  be 
expressed  as 
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(10) 


cos  ( t)  ‘lect  ( — ) 

a 


with  spatial  transform 


2sin(  (A:-p)a)  2sin{  (k+fi)  a) 

(ic-p)  ®  (ic+p) 


(11) 


Then,  for  probe  position  Xg,  the  system  response  can  be  expressed  as 


h(Xg,  t)  =  r rect— cos  (px-ot) p(x-Xg)  dx 
J  d 


(12a) 


=rect— cos  (Pxo-ot)  *p(Xq) 


(12b) 


where  *  denotes  the  convolution  of  the  probe  function  with  the  illuminated 
portion  of  the  acoustic  time  waveform  in  the  AO  cell.  Since  the  Fourier 
transform  of  the  convolution  of  two  functions  is  the  product  of  their  Fourier 
transforms,  it  is  easiest  to  evaluate  the  system  response  from  the  form 

h{Xg,  C)  =9"'^  (.^[rect— cos  (Px-Q-ut)  ]  9[p(Xq)  ]  )  (13) 

where  c?  indicates  the  Fourier  transform.  Using  the  expressions  for  the 

Fourier  transforms  given  in  Equations  (7)  and  (11)  leads  to  the  integral 
expressions 


e-J“^-^sin  ^  lect(kd^)  e^^dk  (14a) 


(ic+P) 
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(14b) 


1  1 

■a; 


The  integrals  over  k  in  Equation  (14b)  can  be  simplified  by  defining 
u  =  (k-P)  in  the  first  integral,  and  u  =  (k+p)  in  the  second  integral.  Then, 
for  both  integrals,  du  =  dk,  and  the  limits  of  integration  become  (-l/dj-p) 
and  (l/dp-p)  for  the  first  integral,  and(-l/do+P)  and  (1/do+p)  for  the  second 
integral.  The  limits  of  integration  on  the  two  integrals  can  be  made  the 
same;  let  u  -  -u  in  the  first  integral,  providing  the  same  integration  limits 
as  in  the  second  integral  with  only  a  change  in  the  sign  of  the  argument  of  ’ 
the  exponential.  Hence,  the  two  integrals  can  be  combined,  with  the  sum  of 
the  exponentials  of  the  two  parts  combining  via  the  identity  2*cos(2)  = 
(exp(j2)  +  exp(-jz)l.  So,  Equation  (14b)  becomes 


h{x,t)=—  j  sin-^-H^cos  (Px-ux-(i)t)  du  (15) 

Expanding  the  cosine  term  via  the  trigonometric  angle  sum  rule  changes 
Equation  (15)  to 


h[x,  t)  =  — cos  (ut) 

TZ 


/ 

"k* 


sin(ua) 


cos [x(p-u) ] du 


(16a) 


+  — sin  (ot) 

7t 


j  sin-lH£l.sin [x(p-u)  ]  du, 


(16b) 


which  has  the  form 


h{x,  t)  =A(x,  u)  cos  (ut)  +fl(x,  w)  sin(ut)  [o  t-arctan-|]  .  (17) 
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Hence,  the  effect  of  a  realistic,  finite  aperture  probe  beam  is  found  from  the 
factor 


sin(Px)  j"  (xu)  du-cos  (Px)  j 


sin(au)  „  ■  ,  > 

- i — i-sin  (xu)  du 


$  (x,  w)  =atan[- 


sin(px)  j  sinUu)  sin(xu)  du+cos  (Px)  j  cos  (xu)  du 


1 

■5’ 


sin{au) 


■■a;’ 


(18) 

Figure  5  shows  a  comparison  of  ${x,tJ)  versus  u  for  several  closely 
spaced,  fixed  values  of  x,  for  realistic  laser  frequency  and  beamwidth,  AO 
cell  and  lens  parameters  (see  Table  I).  The  results  of  the  calculations 
displayed  in  Figure  5  are  comparable  to  the  data  of  Figure  3. 

Figure  6  shows  a  comparison  of  9{x,u>)  versus  o)  for  two  largely  separated 
values  of  x,  and  should  be  compared  with  Figure  4.  In  Figure  4,  time  delay  is 
the  measured  value  of  the  slope  of  the  $  versus  o  lines  (ignoring  the  small 
oscillation  behavior),  and  is  equal  to  1.67  x  10"’  seconds  for  the  maximum 
slope  curve.  The  minimum  slope  curve  in  Figure  4  is  assumed  to  have  zero 
slope.  The  predicted  value  for  the  maximum  slope  curve,  assuming  a  full  2° 
turn  in  the  rotating  mirror  relative  to  its  initial  position  and  using  the 
expression  AT  =  2A9„F/v,  is  1.74  x  10"^  seconds,  with  the  values  F  =  0.5  inches 
and  V  =  5.1  km/sec  (Indium  phosphide  AO  transducer)  used  in  the  experiment 
[11].  Since  AT  is  also  given  by  AT  =  Ax/v,  the  expected  value  of  Ax  would  be 
8.9  X  10'’m.  Figure  6,  which  shows  remarkable  similarity  to  the  data  of 
Figure  4,  was  calculated  for  x,  =  0.01  x  10'*m  and  x,  =  10.41  x  10'’m,  ie,  for 
Ax  =  10.3  X  lO'^m,  a  deviation  of  only  16%  from  the  experimental  value.  This 
is  a  favorable  comparison,  considering  the  accuracy  with  which  the  mirror 
deflection  in  the  experiment  was  measured  is. 

It  was  mentioned  in  Section  II  that  the  oscillations  in  the  data  were 
found  to  be  relatively  insensitive  to  changes  in  laser  and  optical  parameters 
[10].  This  behavior  was  also  reproduced  by  the  calculation  of  $(x,(i)) 
presented  here.  The  key  parameters  all  enter  into  the  single  variable  d^  = 
AF/d,  the  Rayleigh  width  of  the  probe  beam,  hence,  if  one  key  parameter  is 
systematically  varied,  the  analogous  effect  of  varying  any  of  the  other 
parameters  is  immediately  apparent. 
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The  parameter  which  was  varied  in  this  work  was  the  reference  beam  width 
d.  As  d  becomes  infinitely  large,  the  probe  beam  diameter  (which  can  be 
viewed  as  the  focussed  reference  beam)  becomes  infinitesimally  small.  The 
reference  beam  width  was  varied  from  a  minimum  of  0.0012  m  to  a  maximum  of 
0.01  m,  over  a  factor  of  eight  variation.  The  results  of  this  variation  can 
be  seen  in  Figures  (7a,b,c,d),  in  which  d  =  0.0012m,  0.0024m,  0.0036m,  and 
0.01m,  respectively.  Notice  that  for  the  initial  factor  of  three  increase  in 
beam  diameter,  as  seen  in  Figures  (7a)  -  (7c),  the  change  in  *  is  negligible. 
Only  in  Figure  (7d)  is  the  approach  of  ®  to  a  purely  linear  behavior  seen,  as 
expected  for  an  infinite  plane  wave  focussed  to  a  point-like  probe.  When  d  = 
0.01m,  the  reference  beamwidth,  before  focussing,  is  over  four  times  that  of 
the  signal  beam.  , 

Note  that  the  minimal  effect  of  varying  reference  beamwidth  over  a  large 
range  of  values  could  also  be  claimed  as  the  result  of  maintaining  d  at  its 
original  value  and  varying  F  or  A.  The  focal  length  F  could  be  viewed  as 
changing  by  a  factor  of  two  increase  (up  to  F  =  25.4mm),  or  down  in  size  by  66 
percent  without  changing  the  appearance  of  the  nonlinear  oscillations  in  4.  A 
reduction  in  F  to  over  25  percent  of  its  original  value,  ie,  from  12.7  mm  to 
3.0  mm)  is  required  to  produce  the  approach  to  linear  behavior  seen  in  Figure 
(7d).  Laser  wavelength  could  be  viewed  as  ranging  from  0.7  microns  to  2.12 
microns  without  showing  a  change  in  the  small  oscillations. 


IV.  EFFECTS  OF  A  FINITE-APERTURE  PHOTODETECTOR 


Although,  there  will  be  diffraction  effects  in  the  output  signal  of  the 
photonic  delay  line  due  to  the  finite  aperture  of  the  photodetector,  it  can  be 
shown  that  these  effects  are  constant  with  respect  to  frequency,  once  x  is 
fixed.  Hence,  these  effects  would  be  scaled  out  of  the  data  on  4(x,u)  versus 
u  once  the  mirror  position  is  fixed,  and  so  would  not  be  useful  in  explaining 
the  data  of  Figures  3  and  4. 

To  understand  the  conclusion  presented  above,  consider  the  full 
expression  for  the  intensity  of  the  time-varying  signal  incident  on  the 
photodetector  surface,  Equation  3.  This  signal  (in  which,  for  clarity  here, 
the  effects  of  a  finite  probe  beam  have  not  been  included),  must  be  integrated 
over  the  width  of  the  photodetector  aperture,  Xp  [10].  (It  is  natural  for 
this  system,  in  which  a  one-dimensional  "fan"  of  optical  beams  leaves  the  AO 
cell,  to  consider  only  the  x-dimension  throughout).  Hence, 
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X, 


(19) 


Is(t)  ~A,fAs  J  COS  l2n  (f^+ft)  -—j^+i)]dXp, 

where  f  =  (Af  +  f„)  as  before.  Expanding  the  cosine  so  as  to  separate  out  the 
time  variation  of  the  argument  yields 


7CX  ^  2 

J^{  t)  ~A^^[cos  (27t/t)  j*  COS  (27cfj^-^^^+<|>)  dXp-sin  (2iift)  J  sin(27ifjjX--^^i-+(J))  dXp] 

0  0 

(20) 

which  can  be  rearranged  in  the  form  Is(t) -Ijocos  (27ift-$) ,  where 


^SO^^R^S, 


N 


(  j  cos(2nf^- 


7tXp‘ 

XF 


+4>)cix  ) 

p 


(  j  sin  (27ifjjX--T^+4))  dx  ) 


and 


r  Ttx  2 

j  sin(2jtf^--^^+(j))  dx 

4»=atan[-^ - -]  .  (22) 

/TlX  ^ 

cos  (27lfyX-  +(j))  dx 
0  P 

From  Equation  (22),  it  is  clear  that,  once  the  probe  location  in  the  AO 
cell  (ie,  X)  is  fixed,  variations  in  the  RF  frequency  driving  the  AO  cell  (ie, 
f)  will  not  affect  the  value  of  $.  Hence,  the  effect  of  the  photodetector 
aperture  on  the  phase  output  of  the  system  is  only  to  add  an  overall  constant 
to  for  a  fixed  value  of  x.  This  effect  would  not,  then,  be  relevant  to  the 
data  taken  in  References  3  and  7,  displayed  in  Figures  3  and  4. 

V.  CONCLUSIONS  AND  RECOMMENDATIONS 

In  this  paper,  it  has  been  shown  that  in  applications  of  the  photonic 
delay  line,  such  as  a  beamformer  for  wideband  phased  array  radar  systems,  care 
must  be  taken  due  to  the  effects  of  finite  apertures  in  the  optical  system. 
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The  linear  dependence  of  output  phase  on  the  signal  frequency  driving  the  AO 
cell  is  only  approximate.  The  finite,  rather  than  point-like,  spot  size  of 
the  probe  in  the  AO  cell  due  to  diffraction  of  the  optical  beam  introduces 
non-negligible  oscillations  in  the  read-out  phase.  Additional  effects  on 
are  introduced  due  to  the  finite  aperture  of  the  photodetector  in  the  optical 
system. 

In  order  to  evaluate  the  photonic  delay  line  for  use  in  a  radar  system, 
the  real  behavior  of  the  output  phase  as  a  function  of  RF  frequency  must  be 
considered.  The  oscillations  in  output  phase  may  have  a  major  role  in 
evaluating  the  impact  of  laser  mode  hopping  on  the  radar  system.  The  pointing 
error  of  a  shifting  laser  beam  will  be  compounded  by,  in  essence,  the 
existence  of  a  range  of  values  for  ®  in  each  frequency  bin.  Detailed 
calculations  are  required  in  order  to  decide  whether  care  in  constructing  the 
optical  system  can  overcome  the  added  hurdle  due  to  the  aperture  effects 
established  here.  Radar  system  beam  squint  and  pointing  accuracy  are  at  risk. 
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Figure  7: 


Realistic  parameters  used  in  calculations  of  ${x,u).  Parameters 
correspond  to  data  in  Reference  7. 

FIGURES 


Photonic  true  time  delay  phase  shifter  for  high-frequency 
applications:  optical  layout. 

Delay  as  a  function  of  position  of  the  optical  probe  beam  along 
the  acoustic  waveform  in  the  Bragg  cell. 

versus  f  data  from  low-frequency  (55  -  80  MHz)  photonic  delay 
line  from  Reference  3. 

♦  versus  f  data  from  high-frequency  (1.29  -  1.31  GHz)  photonic 
delay  line  from  Reference  7. 

it  (labelled  "p")  versus  u  =  2Tcf,  calculated  for  realistic 
parameters  (see  Table  1),  for  small  changes  in  probe  location  x, 
where  each  curve  is  for  a  different  fixed  value  of  x:  solid  line: 

X  =  2.3mm;  dash-dot:  x  =  1.6mm;  dotted  line:  x  =  0.8mm;  dashed 
line;  x  =  0.1mm. 

•t  (labelled  "p")  versus  u  =  27if,  calculated  for  realistic 
parameters  (see  Table  1),  for  a  large  change  in  probe  location  x, 
where  the  dashed  curve  is  for  x  =  0.001mm,  and  the  solid  curve  is 
for  X  =  1.041mm.  Note  that  the  curve  of  maximum  slope  undergos 
several  2k  phase  shifts. 

9  (labelled  "p")  versus  u  =  27tf,  calculated  for  realistic 
parameters  (see  Table  1),  for  probe  location  x  =  0.1mm,  in  order 
to  illustrate  the  slow  dependence  of  il>  on  changes  in  system 
parameter  d,  for  (a)  d  =  1.2mm;  (b)  d  =  2.4mm;  (c)  d  =  3.6mm; 

(d)  d  =  10mm. 
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Table  1 


Symbol  Parameter 

A  Laser  Wavelength 

F  Lens  Focal  Length 

a  Signal  Beamwidth 

d  Reference  Beamwidth 

V  Sound  Speed  in  AO  Cell 

Xpmax  Photodetector  Width 


Value 

1.06  p 

12.7  mm 
2.4  mm 
2.4  mm 
5120  m/s 
0.015  mm 
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AN  INTERFACE  INTEGRATED  CIRCUIT 
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Abstract 

An  interface  Integrated  Circuit  (IC)  was  partially  designed 
using  VHDL.  The  IC  worked  within  an  •'element”,  an  element 
containing  two  memory  ICs  and  two  processor  ICs.  This  IC 
interfaced  an  element  to  an  external  host  processor  via  a 
futurebus,  a  data  bus  from  external  sensors  and  a  bus  internal  to 
the  element.  Elements  are  packaged  within  a  Multi-Chip  Module 
(MCM)  containing  either  four  or  sixteen  elements.  The  system 
supports  boundary  scan  testing. 

The  MCMs  are  a  digital  signal  processing  system  to  be  used  for 
fire  control,  space,  Kalman  filtering  and  fast  Fourier  transforms 
applications  [1]. 
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AN  INTERFACE  INTEGRATED  CIRCUIT  FOR  A 


DIGITAL  SIGNAL  PROCESSING  MULTI  CHIP  MODULE 

Myron  E .  Sveim 

Introduction 

Digital  Signal  Processing  (DSP)  has  uses  in  fire  control, 
radar  antenna  sidelobe  suppression,  target  recognition,  and  space 
applications.  Fast  Fourier  Transforms  (FFT)  and  Kalman  filtering 
are  two  methods  used  in  these  applications,  which  require  fast 
multiplications  techniques.  These  techniques  are  often  implemented 
in  integrated  circuits  designed  around  fast  multiplication 
algorithms . 

Signals  from  radars  and  infra-red  sensors  often  range  widely 
in  amplitude.  In  the  system  described  here  this  problem  is 
attacked  by  using  floating  point  digital  signal  processors. 

In  real  systems  several  DSP  problems  can  be  computed 
simultaneously,  or  processed  in  parallel.  This  system  uses 
massively  parallel  processing  to  speed  the  throughput  [2]. 

The  Integrated  Circuit  (IC)  or  chip  discussed  in  this  paper  is 
to  interface  system  "elements”  to  a  common  futurebus. 
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Element 


An  element  contains  six  integrated  circuits  or  "chips. ”[2] 
See  Figure  1.  The  six  integrated  circuits  are  four  memory 
elements,  and  two  Floating  Point  Application  Specific  Processors 
(FPASP),  or  floating  point  digital  signal  p:  ocessors.  These  ICs 
are  interconnected  by  two  address  busses,  two  data  busses  and  a 
memory  control  bus.  The  ELement  InterFace  (ELIF)  chip  ties  these 
element  internal  buses  together  with  a  futurebus  and  a  data  bus. 

Boundary  scan  testing  is  supported  by  the  ELIF  and  FPASPs. 
The  Joint  Test  Action  Group  (JTAG)  first  created  this  test 
specification.  This  group  recognized  that  as  ICs  grew  larger  and 
more  complex,  special  testing  technigues  must  be  designed  and 
specified.  The  result  is  IEEE  specification  1149. 

Data  Bus 

Sensor  data  is  received  via  optical  fiber  and  receiver.  This 
data  link  and  "Hot  Rod"  receiver  uses  gallium  arsenide  (GaAs) 
technology  for  high  speed  data  rates.  The  "Hot  Rod"  data  link 
system  is  made  by  the  Tri-Quint  company.  The  sensor  data  rate  is 
40  Mbits/sec  over  a  serial  optical  fiber  link.  At  the  Hot  Rod 
receiver  the  serial  data  is  converted  to  a  parallel  40  bit  word. 
Only  32  bits  of  this  40  bit  word  are  used  by  the  ELIF.  The  32 
parallel  data  word  consists  of  two  16  bit  words,  each  word  being 
the  output  of  Analog  to  Digita  (A/D)  converters.  The  A/D  output 
words  are  not  required  to  be  16  bits,  but  can  be  less,  and  the 
unused  bits  will  be  ignored  by  the  FPASPs. 
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The  Hot  Rod  system  used  4B/5B  encoding  and  the  receiver  will  output 
an  error  bit  if  an  invalid  4B/5B  code  is  received.  A  34  bit  deep 
First  In  -  First  Out  (FIFO)  storage  register  is  used  to  prevent 
data  loss  while  the  ELIF  is  handling  futurebus  transactions. 

The  Hot  Rod  receiver  gets  a  40  MHz  clock  signal  from  the  ELIF. 
Futurebus 

Futurebus  specifications  are  contained  in  IEEE  ????.!.  The 
futurebus  connects  the  host  processor  to  either  four  or  sixteen 
elements.  See  Figure  3.  The  futurebus  address/data  bus  is  64 
bits  wide.  The  ELIF  has  a  three  deep  FIFO  data  register  for  the 
futurebus  address/data  bus  to  prevent  data  loss  and  increase 
throughput.  The  host  processor  is  the  centralized  arbiter  and 
therefore  distributed  arbitration  is  not  needed.  Table  1  shows  the 
Future  bus  signals. 
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AD*  -  64  bits  -  bidirectional  Address  &  Data  bus 

GR*  -  2  bits  -  ELIF  to  host  GRant  &  host  to  ELIF  GRant 

RQ*  -  2  bits  -  bidirectional  ReQuest  lines 

AS*  -  1  bit  -  bidirectional  Address  Strobe 

AK*  -  1  bit  -  bidirectional  Address  acKnowledge 

AI*  -  1  bit  -  bidirectional  Address  acknowledge  Inverse 

DS*  -  1  bit  -  bidirectional  Data  Strobe 

DK*  -  1  bit  -  bidirectional  Data  acKnowledge 

DI*  -  1  bit  -  bidirectional  Data  acknowledge  Inverse 

ET*  -  1  bit  -  bidirectional  End  of  Tenure 

GA  -  5  bits  -  output  -  geographical  address 

Futurebus  Lines  to  ELIF 
Table  1 


Internal  Buses 

The  element  internal  buses  (see  Figure  1)  are  four  data,  two 
address  busses  and  a  memory  control  bus.  These  busses  communicate 
between  the  ELIF,  FPASPs  and  the  memory  chips.  See  Table  2. 

The  ADDO  address  bus  services  the  memories  MEM_UO,  MEM_LO  and 
FPASPO,  while  address  bus  ADDl  handles  MEM_U1,  MEM_L1  and  FPASPl. 
Each  FPASP  communicates  with  its  respective  memories  as  well  as 
with  the  other  FPASP  via  the  four  data  busses.  Each  memory  chip 
has  three  lines  controlling  data  direction,  memory  select  and  data 
bus  tri-stating. 
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ADDO  -  ADDress  bus  0  to  FPASPO  and  memories  0. 


Bit  21  specifies  either  MEM_L0  or  MEM_U0. 

ADDl  -  ADDress  bus  1  to  FPASPl  and  memories  1. 

Bit  21  specifies  either  MEM_L1  or  MEM_U1. 

LDO  -  Lower  Data  bus  0-32  bit  used  by  memories  0. 

UDO  -  Upper  Data  bus  0-32  bit  used  by  memories  0. 

LDl  -  Lower  Data  bus  1-32  bit  used  by  memories  1. 

UDl  -  Upper  Data  bus  1-32  bit  used  by  memories  1. 

OE_bar  -  Output  Enable  -  Controls  tristating  of  memories. 
CS_bar  -  Chip  Select  -  selects  individual  memories. 

WE_bar  -  Write  Enable  -  selects  either  read  or  write  mode  for 

each  memory. 

Operate  -  the  ELIF  authorizes  each  FPASP  to  perform  it's 

program . 

BA  -  Bank  Authorization  -  Tells  each  FPASP  which  memory  bank 

it  has  authorization  to  access. 

WD  -  Watch  Dog  -  tells  if  either  FPASP  is  in  the  watchdog 

mode,  in  which  one  FPASP  checks  the 
other's  computations. 

DONE  -  done  -  FPASP  indicates  to  the  ELIF  that  the  FPASP  is 

finished  with  it's  computations, 
often  a  request  for  a  futurebus  service. 
BA  -  Bank  Authorization  -  indicates  to  FPASP  which  memory  bank 

it  has  access  to. 

Element  Buses  &  Control  Signals 
Table  2 
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Element  Interface  Integrated  Circuit  (ELIF^ 

The  ELIF  (see  figure  2)  interfaces  with  the  futurebus,  data 
bus  and  internal  element  busses,  and  does  some  housekeeping  duties. 

The  Clock  Generator  receives  a  40  MHz  clock  from  the  host,  and 
outputs  a  single  phase  clock  to  the  Hot  Rod  receiver,  and  a  two 
phase  clock  to  the  FPASPs.  It  also  receives  test  information  from 
the  JTAG  entity  and  outputs  Testmode  signals  to  the  FPASPs. 

The  JTAG  entity  enables  all  chips  within  the  Element  to  be 
tested  using  JTAG  standard  protocols. 

The  data  FIFO  receives  32  bit  parallel  data  word  from  the 
Triquint  Hot  Rod  receiver  and  puts  two  32  bit  words  into  a  64  bit 
wide  FIFO.  The  FIFO  is  used  to  prevent  data  loss  during  futurebus 
transactions.  The  first  data  word  contains  the  address  of  element 
memory  where  the  data  is  to  be  put.  This  is  stored  in  the  Data 
Address  Register.  The  Data  Address  Register  is  incremented  after 
each  data  transfer  to  Element  memory.  Occasionally  an  invalid  code 
is  received  by  the  Hot  Rod  receiver  and  the  Data  Error  bit  is 
asserted.  The  corrupt  data  addresses  are  stored  in  element  memory, 
and  this  is  controlled  by  the  Error  Address  Register. 

The  ELIF  futurebus  portion  gets  arbitration  signals  from  the 
central  arbiter,  which  is  the  host  computer.  Any  element  can 
request  access  to  the  address/data  bus,  but  the  central  arbiter 
decides  which  element  will  have  access.  The  element  status  is 
stored  in  the  futurebus  Status  Register,  and  is  available  to  the 
host  upon  request.  The  Geographical  Address,  or  physical  address 
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(physical  location)  of  an  element  is  how  the  host  differentiates 
between  elements.  A  reset  line  resets  the  ELIF  and  element  to 
their  initial  conditions. 

Multichip  Module 

Either  four  or  sixteen  elements  are  packaged  in  a  Multichip 
Module  (MCM).  A  MCM  is  several  ICs  interconnected  on  a  common 
supporting  substrate,  and  packaged  as  an  interconnected  group.  The 
substrate  conductors  often  are  formed  in  multiple  layers  separated 
by  insulating  material.  Vias  connect  various  layers.  Circuit 
densities  are  much  greater  than  with  previous  packaging 
techniques [ 3 ] . 

Figure  3  shows  a  four  element  MCM  electrical  block  diagram 
with  the  data  busses,  and  futurebus  connecting  the  host  to  the  MCM. 
Conclusion 

An  interface  IC  was  partially  designed  for  tying  three  busses 
together  using  VHDL  (Very  high  speed  integrated  circuit  Hardware 
Description  Language).  The  busses  were  a  data  bus,  a  futurebus 
connection  all  elements  and  a  host  computer,  and  a  bus  internal  to 
each  element.  Regretfully  there  was  not  sufficient  time  this  last 
summer  to  complete  the  project. 
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ELEMENT  BLOCK  DIAGRAM 
Figure  1 
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ELEMENT  INTERFACE  (ELIF)  INTEGRATED  CIRCUIT 

Figure  2 
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Abstract 


Optical  OR  and  NOR  gates  were  developed  using  Q-switched  GaAs/AlGaAs 
diode  lasers  driven  directly  by  photo  detectors.  The  Q-switched  lasers 
were  designed  and  fabricated.  Both  optical  logic  functions  were 
demonstrated.  In  the  OR  gate,  an  on-off  ratio  of  -1000:1  was  observed 
using  a  multi-mode  Q-switched  laser,  and  an  on-off  ratio  better  than  10:1 
was  observed  for  single  mode  Q-switched  lasers .  The  NOR  gate  on-off  ratio 
was  significantly  less  for  the  multi-mode  Q-switched  lasers.  The  power 
required  to  switch  the  single  mode  Q-switched  lasers  was  less  than  one 
twentieth  the  lasers'  output  power. 
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CASCADABLE  OPTICAL  LOGIC  GATES 


BASED  ON 

DETECTOR  DRIVEN  Q-SWITCHED  LASERS 

Paul  D.  Swanson,  M.A.  Parker,  S.I.  Libby,  J.S.  Kiiranet 
Introduction 

Logic  gates  with  coherent  optical  inputs  and  outputs  are  the  basic 
building  blocks  for  optical  switching  and  signal  processing.  Q-switched 
semiconductor  lasers  provide  the  ideal  output  for  such  gates  due  to  their 
high  on-off  ratios  and  abrupt  switching  characteristics.^  Previous  efforts 
have  developed  NOR  gates  using  high  intensity  input  beams  to  quench  the 
gain  section  within  the  laser  cavity,  thus  Q-switching  it  off.^ 
Unfortunately  such  devices  cannot  by  themselves  be  cascaded  since  the  input 
intensity  must  be  larger  than  the  lasing  intra-cavity  light,  and  the  output 
intensity  will  always  be  smaller  than  the  lasing  intra-cavity  light.  OR 
gates  that  use  input  beams  to  saturate  an  intra-cavity  absorption  section 
suffer  from  the  same  flaw.  The  research  presented  in  this  paper 
demonstrates  a  means  of  Q-switching  a  laser  diode  with  significantly  less 
power  than  the  laser  diode's  output.  This  is  accoitplished  by  voltaicly 
changing  the  absorption  within  a  very  small  modulator  section  of  the  laser 
cavity.  The  voltage  swing  required  to  turn  off  and  on  the  laser  can  be 
easily  provided  by  the  photo-voltaic  response  of  a  photo  detector.  Both  OR 
and  NOR  functionality  can  be  achieved  by  varying  the  orientation  and 
biasing  of  the  detector (s)  . 
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Methodolocrv 


The  Q-switched  lasers  used  a  graded  index,  multiple  guant\im  well 
laser  diode  heterostructure.  Such  material  can  be  used  as  an  optical 
amplifier,  electro-absorptive  modulator,  or  photo-detector,  depending  on 
how  the  diode  is  biased.^ A  forward-bias  current  produces  gain,  while 
reversing  the  bias  increases  both  the  optical  absorption  and  the  resulting 
photocurrent.  The  Q-switched  laser  optical  cavity  consisted  of  a  long  gain 
section  and  a  very  short  modulator  section. 

Figure  1  shows  the  lasing  threshold  current  of  the  gain  section  as  a 
function  of  modulator  bias  for  a  laser  consisting  of  a  200  |iM  by  20  gain 
section  and  a  20  flM  by  20  [IM  modulator. 


MODULATOR  VOLTAGE 


Figure  1.  Threshold  current  of  Q-switched  laser  as  a  function  of  the 
intra-cavity  modulator  voltage. 
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These  data  show  that  the  laser  can  be  turned  off  by  reverse-biasing  the 


modulator,  even  when  the  gain  section  is  biased  significantly  above  the 
threshold  current  required  for  lasing  when  the  modulator  is  not  reverse- 
biased.  The  amount  of  current  required  to  maintain  reverse  bias  on  the 
modulator  is  limited  to  the  photocurrent  caused  by  spontaneous  emission 
since  stimulated  emission  in  the  gain  section  ceases  when  the  laser  is 
turned  off.  When  the  laser  is  turned  on,  the  modulator  photo  current  is 
still  negligible  due  to  the  modulator's  lack  of  absorption  in  forward  bias. 

Figure  2  shows  the  laser/detector  circuit  design  for  the 
implementation  of  the  OR  function.  The  current  source  biasing  the 
modulator  and  detectors  is  designed  to  reverse-bias  the  modulator  in  the 
absence  of  detector  photocurrent.  Thus  the  sourced  current  should  be  set 
equal  to  the  spontaneous  emission  photocurrent  of  the  modulator  when  the 
laser  is  in  the  off  state.  When  the  input  detectors'  photocurrent  becomes 
sufficient  to  satisfy  the  source  current,  the  modulator  voltage  will  swing 
positive  and  the  laser  will  turn  on.  When  input  light  is  terminated,  the 
modulator  voltage  will  swing  negative  and  the  laser  will  turn  off. 


Figure  2.  Schematic  for  the  OR  gate. 
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Figure  3  shows  the  laser/detector  circuit  required  to  achieve  NOR 
functionality.  In  this  design,  the  modulator  voltage  remains  positive  in 
the  absence  of  input  light.  When  the  photocurrent  of  the  detectors  becomes 
larger  than  that  of  the  current  source,  the  modulator  current  will  reverse 
direction,  causing  a  negative  modulator  voltage  swing,  turning  the  laser 
off.  Required  switching  power  should  decrease  with  smaller  sourced 
currents,  while  the  laser  output  power  should  increase  with  larger  forward- 
bias  modulator  currents.  For  this  reason,  maximum  gain  conditions  should 
occur  at  a  critical  biasing  current. 


- • — 
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Figure  3.  Schematic  of  the  NOR  gate. 

The  detectors  for  either  design  can  be  either  external  discrete  photo 
detectors,  or  integrated  in  waveguides  on  the  same  substrate  as  the  Q- 
switched  laser. 


Fabrication 

A  variety  of  two  section  etched  ridge  lasers  with  and  without 
monolithically  integrated  waveguides  was  fabricated  from  a  standard 
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multiple  cpaantum  well  laser  heterostructure,  shown  in  Figure  4.  The 
heterostructure  consists  of  quantum  wells  centered  in  a  p-i-n  diode 
superimposed  on  a  planar  optical  waveguide  (high-index  core  surrounded  by  a 
low-index  cladding) . 


Aluminum  Content  (x) 
0.5  0 


Bottom  Cladding 


0.25|im  GaAs  p  = 

:  2xl0‘’ 

1.5pm  Al^Gaj.^As 

p  =  8xl0” 

0.2um  AlxGa,.xAs 

undopcd 

5  ea.  of  Ga  80  A  ca. 

0.2um  .AIxGai.xAs 

undopcd 

1.5pm  Al^Ga  ,(As 

n  =  5xl0” 

GaAs  Substrate 

n  =  2.xl0’’ 

Figure  4.  A  standard  epitaxially-grown  laser  heterostructure,  utilizing 
multiple  quantum  wells  in  an  active  region  centered  within  a 
graded- index  waveguide. 


The  lasers  require  etched  regions  of  two  different  depths:  a  deep 

etch  that  extends  well  into  the  lower  cladding  region  for  the  laser 
mirrors,  and  a  shallow  etch  to  delineate  the  waveguides.  In  order  to 
create  an  effective  lateral  index  difference,  the  shallow  etch  should  be 
within  100  nm  of  the  top  of  the  planar  waveguide  core.®  Mirror-quality 
vertical  walls  were  etched  using  chemically  assisted  ion  beam  etching 
(CAIBE) J  Chrome,  silicon  dioxide  or  photo-resist  all  can  be  used  as  CAIBE 
etch  masking  materials;  however,  of  the  three,  only  chrome  stands  up  well 
enough  to  produce  good  quality  laser  mirrors.  Originally  chrome  and 
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silicon  dioxide  were  used  to  mask  the  two  different  etch  depths,  but  due  to 


charging  problems  in  the  CAIBE,  the  silicon  dioxide  etch  mask  was  replaced 
with  photo-resist.  The  chrome  doiibled  as  the  top  metal  for  the  p-type 
ohmic  contact,  so  it  did  not  require  removal  after  processing. 

The  devices  were  etched  twice  in  the  CAIBE.  In  the  first  etch  only 
the  regions  to  be  deep  etched  were  not  masked.  The  depth  of  this  etch  was 
that  of  the  difference  of  the  two  desired  etch  depths.  After  the  silicon 
dioxide  (or  photo-resist)  mask  was  removed,  leaving  only  the  chrome  mask,  a 
second  etch  was  performed.  The  depth  of  the  second  etch  was  that  of  the 
desired  shallow  etch. 

After  the  two  CAIBE  etches,  oxygen  was  implanted  at  60  KeV,  120  KeV, 
and  180  KeV  at  a  dose  of  10^^  /cm^  for  electrical  isolation  between  laser 
sections.  The  p-contact  metallization  (20  nm  of  titanium,  20  nm  of 
platinum,  300  nm  of  gold  and  170  nm  of  chrome)  was  used  to  mask  the  oxygen 
implant.  Finally,  the  wafers  were  lapped  down  to  250  |iM  thick,  and  the 
backs  were  metallized  with  10  nm  of  nickel,  40  nm  of  germanium,  80  nm  of 
gold,  100  nm  of  silver  and  70  nm  of  gold  for  the  n-type  ohmic  contact.  The 
contacts  were  alloyed  at  360°C  for  one  minute.  The  wafers  were  then 
cleaved,  mounted  and  tested. 

Figure  5  shows  cross  sections  and  top  views  of  the  processing  of  a 
pair  of  two-section  lasers.  Cleaving  down  the  center  of  the  finished  wafer 
yields  two  lasers ,  each  with  one  cleaved  facet  mirror  and  one  total 
internal  reflection  mirror  (TIR) .  The  laser  output  would  exit  the  cleaved 
facet  mirror,  on  the  modulator  side  of  the  laser. 
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Figure  5.  Cross  sections  and  top  views  of  a  pair  of  two-pad  lasers  during 
progressive  stages  in  fabrication:  (a)  After  depositing  and 
patterning  the  Si02;  (b)  After  the  patterning  of  the  ohmic 
contact  /  oxygen  implant  mask  /  chrome  CAIBE  mask;  (c)  After 
both  CAIBE  etches  and  the  oxygen  implant. 
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Testing 


The  devices  were  tested  using  a  fiber  optic  /  electronic  probe 
station  at  the  Photonics  Laboratory  at  Griffiss  Air  Force  Base.  A  pulse 
generator,  used  as  a  clock  for  the  gates,  pulsed  the  gain  sections  of  the 
lasers  while  discrete  PIN  detectors  were  connected  as  shown  in  Figures  2 
and  3  to  implement  the  OR  and  NOR  gate,  respectively.  A  low-power  helium- 
neon  laser  was  used  as  optical  input  beams.  Because  of  the  design  of  these 
gates,  the  input  light  wavelength  does  not  affect  the  gates'  operation 
(within  the  limits  of  the  sensitivity  of  the  detectors  used) .  Though 
fabricated,  the  gates  with  monolithically  integrated  wave  detectors  were 
not  ready  to  be  tested  by  the  conclusion  of  the  research  program. 
Therefore,  only  the  discrete  tests  are  presented  here. 

Results 

Two  sizes  of  Q-switched  lasers  were  used  to  test  the  gates.  The 
first  consisted  of  a  200  fiM  by  20  gain  section  and  a  20  fiM  by  20  |JM 
modulator,  separated  by  a  2  |iM  shallow  etched  (and  oxygen  implanted)  gap. 
The  second  consisted  of  a  200  |JM  by  4  |iM  gain  region  and  a  4  fiM  by  4  pM 
modulator,  again  separated  by  a  2  HM  etched  gap.  Before  the  oxygen 
implant,  the  isolation  trench  between  the  gain  section  and  the  modulator 
provided  resistances  varying  between  300  and  1000  ohms.  After  the  inplant, 
the  pad-to-pad  resistance  of  the  20  [iM  wide  devices  was  -1.7  Meg  ohms, 
while  the  4  |JM  wide  pad-to-pad  resistance  was  even  higher.  This  resistance 
was  crucial  for  the  performance  of  the  gates,  since  the  currents 
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determining  the  bias  on  the  modulator  were  orders  of  magnitude  smaller  than 
the  pump  currents  driving  the  gain  section. 

OR  Gate 

The  OR  function  was  demonstrated  by  directing  the  helium-neon  laser 
onto  two  PIN  detectors,  which  were  connected  to  a  Q-switched  laser  as  shown 
in  Figure  2.  Both  20  |iM  wide  and  4  |XM  wide  Q-switched  lasers  successfully 
performed  the  OR  function,  with  the  multi  mode  (20  |iM)  lasers  yielding  a 
higher  on-off  ratio  and  the  single  mode  (4  |iM)  lasers  yielding  higher  gain. 
For  each  case,  the  current  source  was  set  to  the  photocurrent  of  the 
modulator  section  when  its  bias  voltage  was  approximately  -4  volts.  This 
current  was  0.7  mA  for  the  20  |iM  square  modulator,  and  0.002  mA  for  the  4 
[IM  square  modulator.  For  the  latter  case,  an  input  power  of  3.95  HW 
switched  on  the  gate,  producing  83  |i.W  of  output  power,  and  thus  providing  a 
power  gain  of  21.  The  20  pM  wide  Q-switched  lasers  did  provide  a  gain 
above  unity. 

Figure  6  shows  the  emission  spectrum  for  an  OR  gate  using  a  20  |iM 
wide  Q-switched  laser.  Both  the  logic  1  output  state  and  the  logic  0 
output  state  are  shown.  From  this  data,  the  on/off  ratio  is  determined  to 
be  approximately  1000:1.  The  4  wide  lasers  yielded  similar  results  with 
an  on/off  ratio  of  better  than  10:1. 
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WAVELENGTH  (NM) 


Figure  6.  The  emission  spectrum  of  an  OR  gate  using  a  20  micron  wide  Q- 
switched  laser  for  both  the  logic  0  and  1  output  states. 

NOR  Gate 

The  NOR  function  was  successfully  demonstrated  by  connecting  the  Q- 
switched  laser  and  PIN  detectors  as  shown  in  Figure  3.  The  negative 
voltage  bias  on  the  detectors  was  set  at  -3  volts.  As  anticipated, 
increasing  the  sourced  current  to  the  modulator  increased  the  output  and 
switching  power;  however,  the  switching  power  increased  at  a  much  faster 
rate  than  the  output  power.  For  this  reason  a  zero-mA  source  current 
yielded  the  best  gain.  Figure  7  shows  the  emission  spectrum  of  a  NOR  gate 
using  a  20  |IM  wide  Q-switched  laser.  Both  the  output  logic  1  and  logic  0 
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state  are  shown  for  the  current  source  set  to  zero  mA. 


Note  that  at  these 


bias  levels  the  on/off  ratio  is  only  3.4:1. 


WAVELENGTH  (MICRONS) 

Figure  7.  The  emission  spectrum  of  a  NOR  gate  using  a  20  pM  wide  Q- 


switched  laser  for  both  the  logic  0  and  logic  1  output  states. 


Conclusion 

Both  the  OR  gate  design  and  the  NOR  gate  design  were  validated, 
using  discrete  detectors  and  various-sized  Q-switched  lasers.  Only  gates 
using  single  mode  lasers  with  very  small  area  modulators  demonstrated  gains 
capable  of  allowing  the  devices  to  be  cascaded  to  the  input  of  a  similar 
gate.  Fully  integrated  single  mode  Q-switched  lasers  and  detectors  were 
fabricated;  however,  time  constraints  prevented  test  results  for  these 
devices  from  being  presented  in  this  report. 
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ABSIRACI 

This  paper  introduces  new  concepts  in  query  processing  needed  to  support  construction 
of  models  from  existing  component  submodels  which  are  stored  in  a  distributed  model 
database  management  system  (MDBMS).  Dynamic  execution  of  submodels  provides  the 
basis  for  evaluation  of  global  model  constraints  specified  as  part  of  the  model  database 
query  or  global  model  specification.  The  paper  describes  a  network  service  approach  to 
modeling  and  simulation  which  was  developed  to  support  the  evolution  of  standard 
interfaces  and  component  model  implementations  required  by  a  MDBMS.  The  MDBMS 
research  direction  presented  in  this  paper  is  part  of  a  strategy  that  the  Modeling  and 
Simulation  Office  at  Rome  Laboratory  is  pursuing  to  address  the  problems  of  model 
interoperability,  reusability  and  construction  of  models  that  operate  with  varying 
degrees  of  fidelity. 
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MASTERMIND:  Modeling  and  Simulation 
Technology  Enhancement  for  Research  in 
Multimedia  Inteliigent  Network  Domains 


Scott  E.  Spetka 

1 )  Introduction  -  Evolution  of  Standards 

The  problems  of  interoperability,  reusability,  and  fidelity  are  related  by  their  potential 
solution  through  dictation  of  standards  for  implementation  of  model  component 
interfaces.  There  is  a  simple  reason  why  this  approach  is  inappropriate  at  this  time. 
Modeling  and  simulation  technology,  especially  in  the  area  of  object-oriented  modeling  is 
still  under  development.  The  areas  of  model  interaction  and  hierarchical  modeling  for 
varying  fidelity  are  currently  the  subject  of  research  in  a  variety  of  diverse  problem 
domains.  Much  of  the  current  research  involves  artificial  intelligence,  distributed 
processing  and  multimedia.  The  approach  of  modeling  and  simulation  technology 
enhancement  for  research  in  a  multimedia  intelligent  network  domain  (MASTERMIND) 
is  to  enhance  existing  and  developing  modeling  and  simulation  technology  in  order  to 
provide  an  environment  in  which  the  modeling  and  simulation  community  can  evolve 
standards  which  facilitate  building  new  models  from  a  distributed  database  of  existing 
models. 

In  section  2,  we  briefly  consider  the  parameters  that  govern  existing  modeling 
environments.  We  also  consider  environments  that  are  currently  under  development  in 
which  standards  will  evolve.  The  MASTERMIND  environment  is  described  in  section  3.  It 
is  based  on  expected  advances  in  the  technology  described  in  section  2.  Section  4 
introduces  the  concept  of  a  Community  Research  Network  Resource  which  is  an  integral 
component  of  a  plan  to  facilitate  the  evolution  of  standards  for  modeling  and  simulation 
and  to  provide  additional  benefits  from  sharing  of  technical  ideas  at  all  levels.  The 
concluding  section  discusses  directions  for  future  research. 
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2)  Current  Modeling  Environments 


Modeling  environments  have  developed  over  the  past  25  years  in  which  some  networking 
is  used  and  in  which  object-oriented  programming  techniques  are  applied.  Most  of  these 
systems  have  been  developed  in  the  past  5  to  1 0  years.  The  growth  of  distributed 
modeling  environments  has  paralleled  the  growth  of  local  area  networks  and  the 
development  of  smaller  and  more  powerful  processors.  This  trend  continues,  with  new 
modeling  and  simulation  technology,  such  as  object-oriented  modeling  being  driven  by 
the  increasing  availability  of  processing  resources  growing  up  around  a  maturing 
national  digital  network. 

The  technologies  that  will  drive  new  developments  in  modeling  and  simulation  are  new 
and  faster  processors,  improved  networking  and  advances  in  object-oriented  systems.  It 
is  important  in  any  research  program  aimed  at  system  development  in  a  broad  area  of 
application  that  the  target  for  design  be  an  architecture  that  is  expected  to  be  affordable 
upon  completion  of  the  project  so  that  software  developed  that  can  take  advantage  of 
expected  gains  in  hardware.  Research  projects  that  build  software  for  today's  expensive 
workstations  help  to  drive  down  the  cost  of  hardware  and  make  it  affordable,  by 
increasing  demand.  The  diagram  below  shows  integrated  modeling  environments,  loosely 
coupled  through  the  Internet. 


2. 1 )  The  Internet 

The  potential  of  increased  access  to  processing  resources  through  public  and  private 
networks  will  be  a  critical  factor  in  modeling  and  simulation  technology  development. 
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Although  some  systems  currently  use  networks  to  implement  distributed  processing,  the 
interfaces  to  distributed  function  interfaces  are  often  proprietary,  special  purpose,  or 
subject  to  change.  They  have  not  been  used  to  provide  access  to  the  functions  that  exist.  It 
is  safe  to  say  that  even  within  companies,  government  research  laboratories,  the  degree 
of  sharing  and  reuse  of  model  components  has  been  disappointing,  given  the  availability 
of  the  Internet  over  the  past  1 0  years. 

Much  of  the  problem  has  been  associated  with  the  monolithic  nature  of  many  systems 
developed  over  the  last  1 0  years.  Systems  were  not  constructed  to  provide  convenient 
interfaces  that  could  be  generalized  to  provide  access  to  component  models  in  a  complex 
environment  for  use  in  the  construction  of  new  models.  The  development  of  object- 
oriented  techniques  for  system  design  has  lead  to  significant  improvement  in  this  area. 

The  lack  of  sufficient  bandwidth  in  the  Internet  also  prohibited  the  degree  of  sharing  that 
might  otherwise  have  been  possible  over  this  period.  Distributed  models  and  simulations 
that  were  developed  relied  on  high-speed  local  area  networks  for  service.  Increasing 
bandwidth  in  the  Internet  along  with  object-oriented  technology  are  expected  to  solve 
many  of  the  technical  problems  preventing  distributed  modeling  in  a  truly  interoperable 
environment. 

Communication  within  the  modeling  and  simulation  community  must  improve  for 
increased  sharing  and  development  of  object-oriented  models  that  are  useful  to  the  point 
that  developers  may  reuse  them  in  their  models.  Services  available  on  the  Internet 
provide  simple  file  retrieval  and  are  used  to  share  access  to  every  form  of  software 
available.  Technical  articles,  bitmaps,  system  source  code  and  binary  applications,  and 
more  are  routinely  distributed  through  the  Internet  today.  These  services  can  be 
efficiently  provided  using  database  indexing  techniques  often  depend  on  machines  which 
are  financed  by  government  programs.  As  object-oriented  systems  are  developed,  their 
component  models  can  be  distributed  through  the  existing  mechanisms.  Applications  that 
can  be  used  for  search  and  retrieval  on  the  Internet  include  Gopher,  Archie,  World- 
Wlde-Web,  and  Hytelnet  [Kehoe  92]  [Marine  93]. 

2.2)  Modern  Workstations 

Modern  Workstations  are  driving  the  implementation  of  models  and  simulations  that 
would  not  have  been  possible  1 0  years  ago.  Application  are  being  developed  that  depend 
on  time  consuming  access  to  knowledge  bases  and  the  application  of  other  processor- 
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intensive  techniques  from  the  field  of  artificial  intelligence.  These  approaches  are 
possible  due  to  the  introduction  of  a  new  generation  of  micro-  and  mini-computers.  The 
DEC  Alpha  workstation,  for  example  is  a  64  bit  machine  that  runs  at  200  MHz.  Several 
models  are  available  at  under  $1 5,000.  Fully  networked  UNIX  systems  are  also 
available  for  Intel  80386/486  based  systems.  A  33  MHz  machine  costs  around  $1,500. 

Advances  in  hardware  and  software  are  adding  to  the  quality  of  models  while  supporting 
improved  communication  within  the  modeling  and  simulation  community.  Multimedia 
cards  are  inexpensive  and  color  graphics  is  also  affordable  for  most  users.  Software 
products,  like  DEC's  Mediaimpact  and  Adobe  Premier  provide  easy  interfaces  to  include 
live  video  or  computer  stored  video  recordings  into  models.  Multimedia  hardware  can  be 
used,  along  with  the  Internet,  to  transmit  video  and  audio  for  technical  exchange.  For 
example.  Phone  is  a  free  application,  available  through  anonymous  FTP  which  allows  a 
workstation  to  become  a  telephone,  but  with  no  toll  charges.  The  Internet  talk  radio  is  a 
system  that  uses  the  Internet  as  a  radio  broadcast  channel.  CDROMs  are  also  used  to 
distribute  information  in  all  fields  of  research.  The  DEC  Alpha  workstation  has  an  ISDN 
port  which  can  be  used  to  access  a  new  worldwide  network  that  may  replace  the  Internet 
when  it  becomes  widely  available.  A  modern  networked  workstation  environment 
provides  a  user  with  access  to  the  types  of  software  depicted  in  the  diagram  below. 


Internet 
f  Gopher  and 
V  Archie 


Remote 

Model 

Access 


USER 


f  Multimedia  \ 
V  Development  J 


FTP  Model \ 
Databases  J 

Advanced  ^ 
Curriculum  J 
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2.3)  Object-Oriented  Models 


Object-oriented  systems  are  an  important  software  development  tool.  Their  main 
advantage  is  that  data  and  executable  functions  that  access  the  data  are  packaged  together 
in  objects.  Access  to  objects  is  performed  by  sending  messages  to  invoke  the  functions 
implemented  by  the  object  on  its  associated  data.  Objects  can  be  used  in  the  definition  of 
new  objects  through  a  mechanism  called  inheritance.  The  diagram  below  describes  the 
set  of  object  types  defined  through  inheritance,  starting  with  the  root  Model  object  type. 

All  objects  declared  by  a  programmer  would  normally  inherit  the  data  structures  and 
functions  associated  with  the  Model  object.  For  example  each  object  in  the  hierarchy 
could  inherit  a  model  type  data  element  and  a  function  which  could  be  used  to  identify  it 
as  Military,  Government,  Hospital,  etc.  The  figure  below  shows  a  class  hierarchy  for  a 
system  which  involves  sectors  of  our  society  that  may  be  subject  to  modeling  and 
simulation. 


Multiple  Inheritance  is  a  mechanism  that  allows  new  object  types  to  inherit  their 
definitions  from  objects  in  the  class  hierarchy  that  are  independently  defined  as  shown 
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in  the  diagram  below.  For  example,  a  model  involving  Hospitals  and  Military  would 
define  objects  that  inherit  their  definitions  from  objects  that  have  only,  perhaps,  the 
Model  type  in  common.  This  facility  allows  flexible  models  to  be  defined  in  specialized 
domains,  using  objects  of  any  type.  This  facility  can  be  used  to  take  advantage  of  the 
functions  that  have  been  developed  and  the  data  structures  that  have  been  designed  to 
support  reusability  and  interoperability.  It  also  promotes  the  development  of  component 
models  that  can  be  implemented  in  systems  providing  multiple  levels  of  fidelity,  to 
facilitate  the  construction  of  models  involving  component  types  that  are  already 
implemented.  The  system  designer  can  focus  on  the  problem  of  implementing  variable 
fidelity  with  less  concern  for  implementation  of  details  governing  object  behavior. 


Object-oriented  database  systems  [Chorafas  93]  provide  a  mechanism  for  persistent 
storage  and  access  of  objects.  They  support  definition  of  new  data  models  as  well  as  the 
evolution  of  existing  models  to  support  diverse  and  changing  storage  structure  and 
processing  requirements.  They  provide  a  natural  mechanism  for  integration  of  network 
communication,  multimedia  support  functions,  and  object-specific  processing,  including 
displaying,  storing  and  processing  images,  voice,  and  other  data. 

3)  MASTERMIND  -  The  Future  Environment 

MASTERMIND  can  take  advantage  of  increasingly  available  processing  resources  that  are 
being  developed  for  the  Internet.  Most  Internet  access  today  is  being  used  for  file 
transfer  and  for  simple  point-to-point  communication  channels,  usually  to  exchange 
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audio  or  video  data.  Distributed  services,  such  as  a  modelbase  management  system 
usually  require  a  "critical  mass"  of  users  before  they  are  widely  adopted.  MASTERMIND 
is  a  system  for  access  to  models  in  a  distributed  model  database.  It  will  provide  an 
incentive  for  developers  to  provide  component  models  that  implement  varying  degrees  of 
fidelity  and  make  it  easy  to  use  existing  component  models  in  the  construction  of  new 
models. 

Mastermind  will  allow  a  user  to  easily: 

•  Find  the  component  models  that  produces  the  best  results 

•  Find  the  input  parameters  that  would  produce  that  result 

•  Produce  a  model  for  a  required  degree  of  fidelity 

MASTERMIND  is  a  system  that  supports  a  generalized  notion  of  modeling  and  simulation. 

It  considers  most  computer-based  applications  to  be  models  of  "real"  systems  that  are 
used  to  provide  output  to  users  or  to  connect  directly  to  a  device  that  controls  a  real¬ 
time  process  based  on  predictions  or  decisions  that  are  generated  by  a  model.  A 
generalized  concept  of  query  processing  is  applied  to  a  "model  base"  or  collection  of 
models.  A  new  query  processing  paradigm  allows  selection  of  models,  based  on  their 
characteristic  attributes  and  based  on  the  results  of  model  execution,  given  a  set  of  input 
parameters.  It  also  allows  determination  of  appropriate  model  parameters,  given  a  set  of 
inputs  and  a  required  result. 

The  generalized  query  processing  paradigm  goes  beyond  the  most  advanced  concepts  in 
query  processing  associated  with  logic  programming  and  object-oriented  databases.  In 
addition  to  searching  the  database  for  specific  "matches"  to  query  inputs,  a  model 
database  management  system  (MDBMS)  is  expected  to  derive  parameters  from,  possibly 
continuous,  distributions  associated  with  model  parameters.  This  approach  allows  a 
user,  querying  the  modelbase  to  find  the  set  of  component  models  which,  when  integrated 
will  produce  an  output  that  is  closest  to  the  desired  output.  Models  are  often  used  to 
verify  expected  results  and  show  that  a  system  should  be  built  before  its  actual 
implementation.  Mastermind  will  also  serve  to  adjust  input  parameters  to  improve  the 
result  when  directed  to  do  so. 

In  the  long-term,  MASTERMIND  will  develop  into  a  facility  that  extends  beyond  anything 
that  is  currently  envisioned  by  experts  in  either  the  modeling  and  simulation  or  the 
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database  communities.  Advanced  MDBMS  query  processing  will  model  initialization 
parameters  for  input,  but  also  provide  for  specification  of  model  output  as  input  to  the 
query  processor.  The  system  will  then  execute  potentially  useful  combinations  of 
component  models  to  determine  which  composite  models  return  the  best  results. 

3.1)  MASTERMIND  Architecture: 

Sever  mechanism  and  functions  characterize  the  MASTERMIND  architecture: 

•  A  set  of  models  that  can  be  used  as  examples  in  coursework  at  all  levels.  A  model 
for  counting  sheep  may  help  a  first  grader  to  develop  mathematical  skills,  without 
knowing  how  to  actually  build  a  model.  A  graduate  researcher  could  use  MASTERMIND  to 
try  a  new  queuing  technique  in  a  model  implementation. 

•  Models  that  can  be  executed  by  researchers  without  concern  for  implementation 
details  along  with  functions  for  the  update  of  stored  models. 

•  A  mechanism  for  interfacing  models  which  provides  the  capability  to  build  new 
models  that  use  existing  models. 

•  A  facility  to  store  interface  "models"  that  allow  reuse  of  interfaces  designed  to 
connect  models. 

•  Access  to  information  about  ail  models  in  the  modelbase  including  technology  used 
to  develop  the  model  and  other  descriptive  information  that  can  be  used  by  users  at  all 
levels,  from  beginning  to  expert. 

•  A  general  query  capability  that  provides  access  to  all  information,  from  access  to 
model  descriptions  to  execution  of  appropriate  models  to  solve  a  problem. 

3.2)  MASTERMIND  Queries 

MASTERMIND  queries  provide  a  generalized  facility  for  building  models  from  existing 
models.  Given  a  model  description,  including  input  parameters,  initialization  data  and 
required  output,  the  system  automatically  builds  and  executes  a  model  that  best  solves 
the  problem,  given  existing  technoiogy.  The  system  can  also  report  on  other  possible 
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approaches  to  building  a  model  that  would  require  additional  interfaces  to  be  developed. 
MASTERMIND  provides  a  dynamic  querying  environment  based  on  model  execution,  in 
addition  to  providing  a  static  query  capability  that  would  allow  selection  of  models  or 
model  components  based  on  their  static  characteristics. 

The  MASTERMIND  approach  requires  development  of  new  query  processing  technology  as 
well  as  development  of  a  modelbase  to  support  the  system.  Instead  of  searching  for 
models  that  would  support  the  expected  types  of  input  and  output  parameters,  the  system 
searches  for  the  best  model  execution  to  produce  a  specific  set  of  output  parameters, 
given  specific  input. 

Models  in  the  database  can  be  designed  to  accept  variable  numbers  of  input  parameters. 
Variable  input  parameter  models  use  default  parameters  when  user  inputs  are  not 
specified.  For  example,  an  FI  6  model  may  produce  a  number  of  targets  capable  of  being 
acquired,  given  the  number  of  FI  6s  and  a  rate  of  target  acquisition  per  FI  6.  Specifying 
only  the  number  of  FI  6s  in  a  model  would  result  in  use  of  the  default  target  acquisition 
ratio.  Accessing  models  with  variable  numbers  of  parameters  is  supported  by  standard 
object-oriented  database  technology. 

When  component  model  input  and  output  parameters  match,  a  model  created  by 
pipelining  the  result  of  one  component  to  the  other  may  satisfy  user  requirements.  At  the 
other  extreme,  a  user  finding  that  there  are  no  models  in  the  database  that  satisfy  a 
problem  statement  would  be  required  to  build  component  models  and  implement  their 
interface.  We  believe  that  many  user  requirements  could  be  satisfied  by  developing  new 
interfaces  to  existing  models.  But,  for  this  approach  to  be  successful,  it  is  necessary  that 
interface  standards  for  modeling  and  simulation  be  developed,  in  order  to  avoid  an 
exponential  explosion  of  interface  modules. 

MASTERMIND  will  encourage  development  of  reusable  model  components.  Many  of  the 
models  that  have  been  developed  incorporate  internal  implementations  of  submodels  that 
could  be  made  publicly  available  if  a  system  such  as  MASTERMIND  were  available. 

3.3  Processing  Requirement  for  Modeibase  Queries 

MASTERMIND  processing  requirements  will  address  many  of  the  issues  that  are  common 
for  distributed  database  systems.  In  developing  techniques  to  support  the  dynamic  model 
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execution  paradigm  for  query  processing,  the  system  must  use  techniques  to  maximize 
parallelism  by  predict  results  which  can  allow  dependent  stages  of  processing  to  be 
precomputed  based  on  projected  outcomes  while  producing  reasonable  approximations. 
Many  of  the  issues  that  are  raised  in  the  area  of  query  decomposition  strategies  appear  to 
be  similar  to  those  that  are  addressed  in  [Spetka  92].  Generalized  procedures  for  query 
decomposition  allow  component  models  to  be  processed  in  arbitrary  order. 

MASTERMIND  query  optimization  will  take  into  account; 

•  Need  to  execute  queries  to  determine  when  to  select  a  model 

•  Need  for  greater  parallelism  and  increased  computing  power 

•  Increased  opportunities  for  parallelism  but  reduced  pipelining 

3.4)  Comparison  to  RDBMS  Query  Processing 

Mastermind  queries  are  significantly  different  from  relational  database  queries.  As  the 
diagrams  below  show,  in  an  RDBMS,  the  database  scheme  is  defined  and  a  database 
instance  is  installed.  Queries  are  then  submitted  to  search  the  database  to  look  for 
existing  relationships.  Simple  text  comparison  is  used  to  find  matching  entries  from 
tables  involved  in  a  query  to  produce  a  result.  Indexing  techniques  can  be  applied  to  make 
processing  very  efficient.  In  the  example  shown  below,  a  match  of  entries  in  the  tables 
can  easily  produce  a  listing  of  names  of  courses  taught  by  a  particular  professor. 


CS300 

Fortran 

CS430 

Modeling 

CS500 

Queueing 

CS600 

Seminar 

Smith 

CS430 

Spetka 

CS430 

Spetka 

CS500 

Thomas 

CS430 

Professor  Relation  Course  Relation 
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In  a  model  base,  it  is  not  the  end  result  of  the  model  that  is  important  but  the  models  used 
to  produce  the  result.  This  suits  most  classical  modeling  needs  since  it  allows  production 
of  a  model  that  can  best  produce  a  result.  Query  input  parameters  define  the  models  that 
are  accessed  by  the  query  to  dynamically  configure  the  component  model  to  find  the  best 
component  attributes  or  to  meet  the  specification  of  a  problem. 

In  addition,  a  modelbase  query  has  to  express  the  processes  that  are  needed  by  the  model. 
It  may  be  possible  for  two  completely  independent  sets  of  detailed  models  to  produce  the 
same  output.  In  the  diagram  below.  Models  1  and  2  can  be  used  to  produce  a  required 
output.  The  diagram  also  shows  that  the  query  also  produces  2  other  distinct  models  of 
higher  fidelity,  involving  component  models  3  and  4.  Models  3  and  4  could  be  alternative 
implementations  for  the  same  functional  model.  Note  that  a  search  of  the  modelbase  could 
be  restricted  to  include  only  model  types  1  and  2  if  higher  fidelity  models  are  not 
desired. 


4)  A  Community  Research  Network  Resource 

The  Rome  Laboratory  Modeling  and  Simulation  Community  Resource  Network  Resource 
(CRNR)  addresses  the  need  to  improve  interaction  of  researchers  and  educators  in  all 
sectors  of  the  modeling  and  simulation  community;  government,  university  and  industry. 
The  vision  of  a  CRNR  calls  for  collaboration  in  the  community  to  advance  ail  critical 
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technologies  through  effective  use  of  modeling  and  simulation.  The  CRNR  will  evolve 
through  contributions  from  community  members  in  both  software  and  expertise. 

The  CRNR  is  designed  as  a  conduit  for  the  flow  of  expertise  and  knowledge  to  benefit  all 
members  of  the  community.  It  is  a  flexible  mechanism  designed  to  support  the  needs  of 
top  experts  in  the  field  and  novices  struggling  to  build  their  first  model. 

The  CRNR  provides  a  layer  of  multimedia  tools  for  intelligent  access  to  modeling 
resources  through  the  CRNR  network.  The  CRNR  provides  broad  access  functionality  to 
support  cooperative  development  of  models  by  researchers  distributed  throughout 
industry,  university  and  government  laboratories.  In  addition  to  providing  the  basic 
tools  needed  by  experts  to  work  in  a  specific  problem  area,  the  system  provides 
mechanisms  to  facilitate  education  and  research  in  modeling  and  simulation  at  all  levels 
within  the  modeling  and  simulation  community. 

4.1)  Modeling  Simulation  and  Analysis  Research  Laboratory 

The  Modeling  Simulation  and  Analysis  Research  Laboratory  (MSARL)  will  serve  as  the 
hub  of  the  CRNR.  A  laboratory  is  needed  in  the  CRNR  to  provide  resources  for  remote 
model  execution,  for  example  to  support  MASTERMIND,  and  to  encourage  the  use  of 
object-oriented  modeling  techniques  that  will  foster  interoperability,  reusability  and 
variable  fidelity.  Researchers  from  all  sectors  of  the  community  will  be  anxious  to 
provide  component  models  that  are  able  to  interoperate  with  existing  models  to  help 
define  interfaces.  Interfaces  will  evolve  to  become  de  facto  standards  when  they  are 
accepted  as  the  premier  interface  for  a  heavily  used  component.  Object-oriented 
technology  will  of  course  provide  for  changes  in  the  implementation  of  a  component 
model  after  its  interface  is  established.  Component  models  that  are  inefficient  or  provide 
an  inconvenient  interface  may  be  dismissed  for  neglect.  It  is  essential  that  the 
government  support  the  initial  development  of  the  CRNR  to  encourage  development  of 
facilities  that  extend  into  the  community,  in  addition  to  MSARL  The  diagram  below  shows 
MSARL  playing  the  role  of  the  hub  in  the  CRNR. 


26-14 


r 


4.2)  Anonymous  FTP  Model  Description  Database 

For  simple  inquiries  regarding  availability  of  models  with  particular  characteristics, 

Internet  access  should  prove  acceptable.  In  fact,  a  system  to  support  this  level  of  model 
access  has  been  implemented  at  Rome  Laboratory  as  a  first  step  in  CRNR  development. 
The  mechanism  uses  anonymous  FTP  for  access  to  descriptions  of  Rome  Laboratory 
models  and  simulations.  Other  interfaces,  such  as  Gopher  and  NFS  mounting  can  be  used 
to  "export"  publicly  available  information  to  the  modeling  and  simulation  community  in 
electronic  form.  NFS  can  make  remote  files  appear  as  if  they  were  part  of  a  local  disk.  In 
addition  to  model  descriptions,  technical  papers  and  discussion  of  technical  issues  can  be 
shared  through  USENET  News  service  and  other  developing  systems. 

4.3)  Distributed  Classroom 

The  CRNR  will  bring  together  experts  from  the  modeling  and  simulation  community. 
Videoconferencing  is  an  important  technology  for  experts  to  develop  new  ideas  and  to 
interact  with  students  at  all  levels  of  study.  The  nature  of  the  CRNR  demands  a  flexible 
network  configuration.  Experts  will  at  times  require  connection  to  many  groups  of 
"students".  However,  experts  may  become  students  in  some  cases  to  learn  from  other 
experts.  The  diagram  below  shows  a  possible  configuration  for  a  videoconference  in 
support  of  a  structured  distributed  classroom. 
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The  CRNR  will  be  used  experimentally  in  a  structured  classroom  environment  at  SUNY 
Institute  of  Technology  at  Utica/Rome,  CSC555,  a  course  in  Models  and  Metrics  for 
Computer  System  Performance  Evaluation  will  be  offered  in  Spring  1 994  to  test  the 
capabilities  of  the  developing  CRNR  and  take  advantage  of  initial  tools  and  information 
resources  provided  by  the  Rome  Laboratory  MSARL.  In  addition,  the  expertise  of  the 
Office  of  Modeling  and  Simulation  will  contribute  technical  support  in  the  development 
and  offering  of  the  course.  Initial  plans  for  the  collaborative  effort  began  during  the 
period  of  the  Summer  Research  Program. 

Initial  CRNR  contributions  to  the  distributed  classroom: 


•  Network  access  to  MSARL  resources 

•  Experts  to  participate  in  classroom 

•  Develop  resources  for  use  in  classroom 

•  Develop  curriculum  for  CSC555 

•  Introduction  to  modeling  and  simulation  concepts  -  Stella® 
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•  Modeling  Tools  and  Language  - 

CACI:  Network  II,  Simscript,  Modsim  ll,Mathematica 

4.4)  A  Remote  Workstation  Server  Interface 

It  is  important  that  as  many  members  of  the  modeling  and  simulation  community  as 
possible  be  networked  into  the  CRNR,  to  provide  momentum  and  encourage  the  sharing  of 
ideas  that  will  drive  the  development  of  the  CRNR.  Early  in  the  project  an  inexpensive 
interface  system  will  be  developed  to  give  access  to  members  who  cannot  afford  a 
machine  that  will  be  typical  of  the  CRNR  model  execution  sites,  like  MSARL  The  system 
must  provide  a  graphical  interface  and  be  capable  of  supporting  the  remote  model  access 
protocols  initially  developed  for  the  CRNR.  A  system  is  being  developed  that  will 
demonstrate  the  feasibility  of  an  initial  remote  workstation  server  on  an  Intel 
80386/DX  platform  running  the  386BSD  operating  system.  The  software  developed  is 
expected  to  be  portable  to  the  next  generation  of  inexpensive  platforms,  based  on  the 
Intel  Pentium  chip. 

4.5)  ISDN  Network 

The  communication  pathways  needed  by  the  system  are  as  diverse  as  the  mechanisms 
they  support.  A  fully  functional  CRNR  will  require  flexibility  not  available  through  the 
Internet,  for  videoconferencing  and  distributed  modeling  environments.  A  major 
problem  that  must  be  addressed  is  the  ability  to  provide  sufficient  communication 
bandwidth  to  meet  these  requirement.  Integrated  services  digital  network  (ISDN) 

[Smouts  91]  is  a  network  being  developed  around  the  world  which  will  provide  the 
improved  services  needed  to  support  the  CRNR  proposal.  The  diagram  below  shows  the 
access  paths  for  component  models  through  the  ISDN  network  to  support  an 
implementation  of  MASTERMIND. 
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4.6)  A  CRNR  Decision  Aid 

Interoperability  of  models  and  model  construction  from  reusable  components  requires 
understanding  relationships  between  models.  In  addition  to  MASTERMIND,  interactive, 
graphical  user  interface  (GUI)  tools  must  be  given  to  system  users  to  facilitate  a  search 
process  for  component  models  that  might  interoperate  to  produce  a  new  model.  An 
interactive  GUI  interface  would  help  to  identify  opportunities  to  exploit  existing  models 
when  no  apparent  useful  interfaces  can  be  automatically  located.  The  interactive  decision 
aid  can  identify  modules  which  might  be  useful  if  fuzzy  logic  techniques  can  be  applied  to 
adapt  interfaces  to  conform  to  existing  interfaces  on  other  models.  A  prototype  decision 
aid  was  built  over  the  summer  in  the  Rome  Laboratory  Office  of  Modeling  and  Simulation 
[Dobransky  93]. 

5)  CONCLUSION 

The  Summer  Research  Program  lead  to  a  new  concept  in  query  processing  and  an 
integrated,  evolutionary  approach  to  solving  the  problems  of  interoperability, 
reusability  and  variable  fidelity.  The  concepts  that  developed  were  the  result  of  expert 
leadership  in  the  Office  of  Modeling  and  Simulation  and  exposure  to  a  broad  range  of 
modeling  efforts  both  in  Rome  Laboratory  and  at  other  laboratories  in  the  government, 
private  industry  and  academics.  In  addition  to  exploring  technical  articles  and  books  in 
the  modeling  and  simulation  field,  the  Summer  Research  Program  implemented  a 
microcosm  of  the  Community  Research  Network  Resource  which  we  sought  to  implement. 
Our  efforts  validated  many  of  our  hypothesis  regarding  the  need  for  cooperation  and 
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sharing  to  provide  advances  in  modeling  and  simulation  and  provide  an  environment  in 
which  researchers  will  continue  to  flourish  and  maintain  technical  leadership  in  the 
area.  The  paper  concludes  by  presenting  the  research  and  development  issues  which 
remain  to  be  addressed  to  make  the  CRNR  vision  become  a  reality. 

CRNR  Development: 

•  Interaction  with  modeling  and  simulation  community 

•  Development  of  on-line  model  standards  for  CRNR  access 

In  addition  to  running  a  model,  expert  information  about 
the  model  is  needed.  Internet  protocol  for  model  access. 

•  Develop  ISDN  access  to  CRNR. 

•  Monitor  community  interest  in  RL  models  through  the  FTP  database. 

•  Develop  graduate  course  to  utilize  CRNR. 

•  Develop  query  processing  techniques  for  CRNR  access,  including  decision  aid 

and  MASTERMIND. 

•  Develop  distributed  simulation  techniques 

Research  Areas: 

Parameter  conversion  functions  and  model  interfaces  will  require  research  in  the 
following  areas: 

•  Fuzzy  Logic 

\ 

•  Knowledge-Based  Rules 

•  Neural  Nets/Petri-Nets 

•  Object-Oriented  Systems  -  Operator  Overloading/Polymorphism 
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